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The present invention provides ccropositions, including 
vectors, and methods for the rapid snbcloning of nucleic acid 
sequences in vivo and in vitro. In particular, the invention 
provides vectors used to contain a gene of interest that 
comprise a seoucrjcc-spccific rccombinasc target site. Ihcsc 
vectors are used to rapidly transfer the gene of interest into 
any expression vector that contains a sequence-speci fic 
rccombinafic target site located downstream or a promoter 
element so that the gene of interest may be expressed. 
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FIG. 2 A 
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KA1HD SUBCLON1NC USING SlTfc^SrEClFIC 
RECOMBINATION 

FIHUD OF THE INVENHON 

The invention relates to recombinant DNA technology. In 
particular, the invention relates to compositions, including 
vectors, and methods Cur the rapid su Deluding of nucleic acid 
sequences in vivo and in vitro. 

BACKGROUND OF THE INVENTION 

Molecular biotechnology has revolution i/rd the pmduc- 
tioo of protein and polypeptide compounds of pharmaco- 
logical importance. The advent of recombinant DNA tech- 
nology permitted for the first time the production of proteins 
on a large scaie in a recombinant host cell rather than by the 
Laborious and expensive isolation of the protein from tissues 
which may contain minute quantities of the desired protein 
(e.g., isolation of human growth hormone from cadaver 



pituitary). lhc production of proteins, including human sequence. The present invention is not limited by the na 
proteins, on a large scale in a heterologous host requires the 20 of the f^^O^c polyadenylation sequence chosen. In 
ability to express the protein of interest in the hctcrolcHmus ernhxxKmenl, the euJcaryolic polyadenylalion seuucnci 
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ability to exprexs the protein of inleicst in the heterologous 
JkmO. This process typically involves isolation or chirring of 
the gene encoding the protein of interest followed by trans- 
rer of the coding region into an expression vector which 
contains elements (e.g., promoters) which direct the expres- 
sion of the desired protein in the heterologous host celL The 
most commonly used means of transferring or subcioning a 
coding region into an expression vector involves the in vitro 
use of restriction cnoo nucleases and DNA ligases. Restric- 
tion end o nucleases arc enzymes which generally recognize 30 
and cleave a specific DNA sequence in a double-stranded 
DNA molecule. Restriction enzymes are used to excise the 
coding region Crum the cloning vector and the excised DNA 
fragment is then joined using DNA ligasc to a suitably 
cleaved expression vector in such a manner that a functional 
protein may be expressed. 

'Ihc ability to transfer the desired coding region to ao 
expression vector is often limited by the availability or 
suitability of restriction crr/.yme recognition site*. Often 40 
multiple restriction enzymes must be employed for the 
removal of the desired coding region and the reaction 
conditions used for each cn/ymo may differ such that it is 
necessary to perform the excision reactions in separate steps. 
In addition, it may be necessary to remove a particular 45 
enzyme used in an initial restriction enzyme reaction prior to 
completing all restriction enzyme digestions; this requires a 
tinic -consuming purification of the subcioning intermediate. 
Ideal methods for the subcioning of DNA molecules would 
permit the rapid transfer of the target DNA molecule from J0 
one vector to another in vitro or in vivo without the need to 
rely upon restriction enzyme digestions. 

SUMMARY OF THU INVENTION 
'lbe present invention provides reagents and methods 55 
which comprise a system for the rapid xuhcloning of nucleic 
acid sequences in vivo and in vitro without the need U» use 
restriction cnzymcK, Accordingly, the present invention pro- 
vides a nucleic acid construct comprising, in operable order 



art and may be employed on the nucleic acid constructs of 
the present invention. In a preferred embodiment, the 
nucleic acid construct further comprises a prokaryolic ter- 
mination sequence. I Tie present invention is not limited by 
the nature of the prokaryotic termination sequence chosen. 
In one embodiment, the prokaryotic termination sequence is 
the T7 termination sequence. A variety of tennination 
sequences are known to the art and may be employed in the 
nucleic acid constructs of the present invention including. 
mc T*iv-!> T^j, T i2 , T i3 , TR 1( TR-^ T w . termination signals 
derived from the bacteriophage lambda [Lambda n, Ilendrix 
ct ai. lids., Cold Spring Harbor Press, Cold Spring Harbor, 
N.Y. (11*3)] and termination signals derived from bacterial 
genes such as the trp gene of K cwtf [Slrycr. Wtxhcmistry 
2nd cd\ W. H. Freeman and Ok, San Francisco, Calif! 
(1981), p. 610]. 

In another preferred embodiment, the nucleic acid con- 
struct further comprises a culcaryotic polyadenylalion 
sequence. The present invention is not limited by the nature 
~r . 1 j 1 one 

- l * , — j j.-*. v,.. sequence is 

selected from the group conning of the bovine growth 
hormone polyadenylation sequence, the simian virus 40 
polyadenylation sequence and the Ilerpes simplex vims 
thymidine kinase polyadcnyUtioo sequence. In yet another 
preferred ernbodimenl, the nucleic acid construct further 
comprises a selectable marker gene. Inc present invention is 
not limited by the nature of the selectable marker gene 
chosen; the selectable marker may be a positive or negative 
selectable market. In a preferred embodiment, the selectable 
marker is selected from the group consisting of the kana- 
mycin resistance gene, the ampkillin resistance gene, the 
tetracycline resistance gene, the chloramphenicol resistance 
gene, the streptomycin resistance gene, the strA gene and the 
sacR gene. 

The present invention is not limited by the nature of the 
sequence -specific recombinase target she employed on the 
nucleic acid construct. In one embodiment, the sequence - 
specific cccombinase target site is selected from the group 
consisting of loxP. loP2. loxP3, loxP23. loxPSll. loxB 
loxC2, loxL, loxR, loxA86, LoxA117,frt, dif, and att. 

In a preferred embodiment, the crodeic construct further 
comprises a gene of interest inserted into the unique restric- 
tion enzyme site. 

In a preferred embodiment, the nucleic acid construct has 
the nucleotide sequence set forth in SCO ID NO:l (Lc 
pUNI-10). V *' 

The present invention further provides a nucleic acid 
construct comprising, in order 5' to 3': a) a conditional origin 
of replication; b) a sequence-specific recombinase target 
site; c) a pcjylinkcr; and d) a selectable marker gene. The 
present invention is not limited by the nature of the 
polylinkcr employed. Any collection of two or more roUric- 
tion enzyme sites which are unique to (he nucleic acid 
construct may be empkiyed as the poly I hiker. The present 
invention is not limited by the nature of the conditional 
origin of replication employed. A variety of conditional 
origins of replication, including terrrpcrarurc -sensitive rep- 



\ iV- , 7 . , * ;i ... w,^,../.^ urucn origins or replication, incrudi na terrrperarure-sensitrve ren- 

a) a ocmduwnal origm of replication; b) a sequence-specific m Kcons are known to the art and ^^^s^l^L 
^^^.^Jr!^ 7* Md * y CC * * nucleic acid constructs of tb^re^t Z^tlt a pre! 



unique rcstrictioo enzyme site, the restriction enzyme site 
located adjacent to the 3* end of the sequence-specific 
recombinase target site. The present invention is not limited 
by the nature of the conditional origin of replication 65 
employed. A variety of conditional origins of replication, 
including temperature-sensitive xepUoons are known to the 



ferreel embodiment, the nucleic acid construct further com- 
prises a prokaryotic tennination sequence. The present 
invention is not limited by the nature of the prokaryotic 
termination sequence chosen. In one embodiment, the 
prokaryotic ter m in a f irm sequence is the 17 termination 
sequence. In another preferred embodiment, the nucleic acid 
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construct further comprises a cukaryotic polyadcnylation 
sequence. The present invention is not limited by the nature 
of the cukaryotic pot y aderry la I ion sequence chosen. In one 
embodiment, the cukaryotic polyadcnylation sequence is 
selected from I be group consisting of the bovine growth 
hormone nolyadenylation sequence, the simian virus 40 
polyadcnylatioa sequence and the Herpes simplex virus 
thymidine kinase poiyadenyUtion sequence. In yet another 
preferred embodiment, the nucleic acid construct further 
comprises a selectable marker gene. The present invention is 
not limited by the nature of the selectable marker gene 
chosen; the selectable marker may be a positive or negative 
selectable marker, in a preferred embodiment, the selectable 
marker is selected from the group consisting of the kana- 
mycin resistance gene, the ampici 11 in resistance gene, the 
tetracycline resistance gene, the chloramphenicol resistance 
gene, the streptomycin resistance gene, the si rA gene and the 
sacH gene. 

The present invention is not limited by the nature of the 
sequence-spec i 5c recomhinase target site employed an the ' 
nucleic acid construct. In one embodiment, the sequence- 
specific recombinase target site is selected from the group 
consisting of ioxP, loxP2. ioxP3, loxP23. loxPSil. loxD. 
loxC2, loxJL, loxR, loxA86, IoxAll7,firt, dif, and att. 



recombinase Urge I s,tc having • 5> and a 3' end, wherein the 
3 end of the promoter element is located upstream of the 5* 
end of the a sequence-specific recombinase target site c) a 
gene of interest joined to the 3* end of the sequence-specific 
recombinase target site such that a functional translatiooal 
reading frame is created; d) a conditional origin of replica- 
tion; e) a first selectable marker gene; f) a second sequence- 
specific recombinase target site; and g) an origin of repli- 
cation. In a preferred embodiment, the first and the second 
sequence-specific recombinase target site have the same 
nucleotide sequence (eg, both comprise a IoxP site). The 
first and the second scqucticc -specific recombinase target 
sue need out have the same nucleotide sequence provided 
the different sites can recomhine with one another. 'ITie 
present invention fet not limited by the nature of cither the 
conditional origin of replication or the non-conditional ori- 
gin of replication employed. 

In a preferred embodiment, the nucleic acid construct 
further comprises a second selectable marker gene. Prefer- 
ably the first and the second selectable marker genes are 
different selectable marker genes. The present invention is 
not limited by the nature of the selectable marker genes 
chosen; the selectable marker genes may be positive or 
negative selectable marker genes. As discussed above, the 



Iq a preferred embodiment, the nucleic construct further 25 present invention is not limited by the nature of the promoter 

' : ; — : ' ' — element nor the nature of the sequence -specific recombinase 

Urgct sites chosen. 

The present invention also provides a method for the in 
vitro recombination of nucleic acid constructs, comprising: 
a) providing: £) a first nucleic acid construct comprising, m 
operable order, a conditional origin of replication, a first 
sequence -fspecific recombinase target site and a first select* 
able marker gene; ii) a second nucleic acid construct 
33 comprising, in operable order, an origin of replication, a 
promoter element and a second sequence -specific recombi- 
nase target site; and Hi) a site-specific recombinase; b) 
contacting the first and the second constructs in vitro with 
the site-specific recombinase under conditions such that the 
first and second constructs are recombioed to form a third 
nucleic acid construct. 

In a preferred embodiment, the first construct employed in 
the method further comprises a gene of interest and the 
recombination of the first and second constructs places the 
45 gene of interest under the transcriptional control of the 
promoter element. In another preferred embodiment, the 
second construct further co m pri s es a nucleotide sequence 
encoding an affinity domain and the recombination of the 
first and second constructs results in placing the gene of 
50 inU;rc ? t in rramc with the sequence encoding the aflirrity 
domain. ]lr»c prate nt invention in not limited by the nature of 
the affinity domain employed; a variety of suitable affinity 
domains are known to the art including gliUattrione-S- 
Iranxfcrase, the maltose binding protein, protein A, polyhis- 
55 tidlne tracts, etc. 

In a preferred embodiment, the second construct further 
comprises a second selectable marker gene, the second 
selectable marker being different from the first selectable 
marker (present on the first construct). The present invention 
mi b not limited by the nature of the selectable marker genes 
chosen; the selectable marker genes may be positive or 
negative selectable marker genes. As discussed above, the 
present invention is not limited by the nature of the promoter 
element nor the nature of the sequence -specific recombinase 
65 Urget talcs chosen. 

Tbc present invention further provides a method for the 
recombination of nucleic acid constructs in a prokaryooc 



comprises a gene of interest inserted into the polylinkec 

The present invention also provides a nucleic acid con- 
struct comprising in operable order a) an origin of replica- 
tion; b) a promoter element having a 5' and a 3 end; and c) 
a sequence-specific recombinase Urgct site having a 5' and 
a 3* end. In a preferred embodiment the 3* end of the 
promoter element is located upstream of the 5* end of said a 
scqucncc^pccuic recombinase target site. The present 
invention is not limited by the nature of the origin of 
replication employed. A variety of non-conditional origins 
of replication arc known U> the art and may be employed on 
the nucleic acid constructs of the present invention. The 
invention is m* limited by the nature of the promoter 
clement employed. 'I "hose skilled in the art know that the 
choice of the promoter element depends upon the type of 
host cell to be employed for expressing a gene<s) under the 
transcriptional control of the chosen promoter element. A 
wide variety of promoter elements functioaal in prokaryotic 
(e.g., E. colt) and eukaryotic (e.g., yeast, insect, marimals 
including humans) cells are known to the art and may be 
employed in the nucleic acid constructs of the present 
invention. 

In another preferred embodiment, the nucleic acid con- 
struct further comprises a selectable marker gene. The 
present invention is not limited by the nature of the select- 
able marker gene chosen; the selectable marker may bo a 
positive or negative selectable marker. In a preferred 
embodiment, the selectable marker is selected from the 
group consisting of the kanamycin resistance gene, the 
ampicillin resistance geoo, the tetracycline resistance gene, 
the chloramphenicol resistance gene, the KtrepLimycin resis- 
tance gene, the suA gene and the sacD gene. The present 
invention is not limited by the nature of the sequence- 
specific recombinase target site employed on the nucleic 
acid construct. In one ernbodtmeot, the sequence-specific 
recombinase target site is selected from the group consisting 
of loxP, loxP2, loxP3, loxP23, LoxPSll, loxB, loxC2, loxL, 
loxR, loxA£6, loxAU7, 5% dif and atL 

The present invention further provides a nucleic acid 
construct comprising in operable order •) a promoter cle- 
ment having a 5' and a 3 end; b) a first sequence-specific 
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host, comprising a) providing: 0 a first nucleic acid con- 
struct com prying a conditional origin of replication, a first 
sequence-specific reenmhinase target site having a 5' and a 
3* end, a unique restriction en/.yme site, the restriction 
cozy me site located adjacent to the 3' end of tbe sequence- 5 
specific recombinant target site, a first selectable marker 
gene aod a eukaryotic poryadeoylalioo scqucoce; ii) a sec- 
ond nucleic acid construct comprising an origin of 
replication, a promoter clement having a 5' and a 3' cod aod 
a second sequence -specific recombinase target site having a a a 
5* and a 3* cod; iii) a procaxyoUc host cell expressing a 
site -specific rccumbinasc; b) inlruuuting the first and said 
second UNA constructs into tbc prukaryolic host cell under 
conditions such that the first and second DNActwislrucLs are 
rcoombincd U» form * third nucleic acid construct capable of 15 
replicating in the host cell. 

In a preferred embodiment, the 3' end of the promoter 
clement is located upstream of the 5' end of tbe second 
seq uence-speci fie recombinase target site. 

In another preferred embodiment, tbc method further 20 
comprises growing tbe host ceil **oti'» iiyng the third con- 
struct under conditions which select for the presence of the 
third construct. 

In a preferred embodiment, the first construct further 
comprises a prokaryotic termination sequence. The present 
invention is not limited by the nature of the prokaryotic 
termination sequence chosen. In one embodiment, the 
prokaryotic termination sequence is the T7 termination 
sequence. 

In another preferred embodiment, the first construct fur- 
ther comprises a #;nc of interest inserted into the unique 
restriction endr» nuclease recognition site. 

Tbe present invention also provides a fusion protein 
comprising protein sequences derived from a glutathione- 
S -transferase (Gat) protein aod a Crc protein; tbe Gsl-Crc 35 
fusion protein may be provided in a purified form. In a 
preferred embodiment, the Gst-Oe fusion protein has the 
amino avid sequence of SEQ ID NO: 11. In another preferred 
embodiment, tbc Gst-Crc fusion protein is encoded by the 
nucleic acid sequence of SEQ TO NOilO. 40 

DESCRIPTION OF THE DRAWINGS 

FIG. 1 provides a schematic illustrating certain elements 
of the pUNl vectors and the Univector Fusion System. 

FIG. 2A provides a schematic map of the pUNI- 10 vector; 45 
the locations of selected restriction enzyme sites are indi- 
cated and unique sites arc indicated by the use of bold type. 

FTO. 2B shows the DNA sequence of the loxP site and the 
polylinkers contained within pUNI-10 (Lo.. nucleotides 
401-530 of SnO rD NO:l). 50 

FTO. 3A Knows the oligonucleotides (SEQ TD NOS:4 and 
5) which were annealed Ui insert a kixP site mm the 
polylinker of pGUX-2TKc* to create pgxt-lox. 

FIG. 3B provides a schematic map of pGKX-ZllCcs 
which includes an enlargement of the multiple cloning site 
(MCS). 

FIG. 4A shows the oUgo nucleotides (SEQ ID NOS:6 aod 
7) which were annealed to insert a loxP site into the 
polylinker of pVL1392 to create pVL1392-lox. 6(| 

HO. 4H provide* a schematic map of pVM392 which 
includes an enlargement uf the molliplc eluding tale (MCS); 
the ampicillin resistance gene (Apft) and the tac promoter 
(^m) v« indicated. 

FIG. 5A shows tbe oligonucleotides (SEQ ID NOS:8 and 
9) which were annealed to insert a loxP site into the 
polylinker of pGAP24 to create pGAP24-lox. 
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HO. 5H pmvides a schematic map of pGAI>24 which 
inchides an enlargement of the multiple cloning site (MCS) 
the ampicillin resistance gene (Ap<&X >he GAP promoter 
("#mpX "X* origin from the 2 /nn circle {2p) and the *l*RPl 
gene, encoding N-^-pbospboribosyn-anthraoilatc 
synthetase, (TRP1) are indicated. 

HO. 6A shows the oligonucleotides (SEQ fD NOS:8 and 
9) which were annealed to insert a loxP site into the 
polylinker of pGAL14 to create pCAL14-lox. 

FIG. 6T3 provides a schematic map of pOAf.14 which 
inchides an enlargement of the multiple cloning site (MCS)* 
the ampicillin resistance gene (Ap©), tbe GAL promoter 
\?c?AtJ* the yeast centromcric sequences (CEN), yeast 
autonomous replication sequences (ARS) and the TRPl gene 
CnU1) are indicated. 

FIG. 7 shows a Coomassic blue-stained SDS-PAGE gel 
showing the purification of Gst-Cre from E. coU ecus 
containing pQLl23. 

FIG. 8 provides a schematic showing mc strategy 
employed for the in vitro recombination of a pUNI vector 
CpA," pUNI-5) with a pIIOST vector ( M pD," pQLlQ3) to 
create a fused construct (**pAB"). Ihe relevant markers on 
each construct arc indicated as arc selected restriction 
enzyme sites, 

FIG. 9A provides a schematic showing the starting con- 
structs (pUNl-Skpl and pgst-lox) and the predicted fusion 
construct (p<3st-Skp1) generated by an in vitro fusion reac- 
tion. 

FIG. 9B shows an ethidium bromide-stained gd w show- 
ing the separation of restriction fragments generated by the * 
digestion of pUNUSkpI, pgst-lox and rKisUSkpl. 

FIG. 10A shows a Coornassie blue-stained SDS-l'AGK 
gel showing the expression of Ihe Gst-Skpl protein from E 
caff cells containing T>Osi-S1cp1. 

FIO. I0R shows a Western blot of an SDS-PAOb gel 
containing extracts prepared from E. coU cells containing 
pGst-Skpl which was probed using an anti-Skpl antibody. 

FIO. II shows a Western bkH of an SIXS-PAOb gel 
containing extracts prepared from E. call cells (QI.R4) 
containing either a conventionally constructed Ost- Skpl 
plasmid or pfist^Skpt (pnHluced by an in vitro fusion 
reaction). 

HO. 12 provides a schematic illustrating the in vivo gene 
trap method for the recombination of lox-con taming vectors 
in a host cell constirutiveJy expressing the Cre protein. 

FIO. 13 provides the nucleotide sequence of the wild-type 
loxP site (SEQ TD NO: 12). the loxP2 site (SEQ ID NO: 13) 
the loxP3 site (SEQ ID NO: 14) and the loxP23 site (SEQ ID 

DEFINITIONS. 

To facilitate understanding of the invention, a number of 
terms are defined below. 

As used herein, "a conditional origin of replication^ refers 
to an origin of replication that requires the presence of a 
functional trans-acting factor (e.g., a replication factor) in a 
prokaryotic host eelL Conditional origins of replication 
pSCioi" 88 tcmpcr * turc " GCnsitivc rcplicons such as rep 

As used herein, the term •'origin of reparation' refers to 
an origin of replication that is functional in a broad range of 
prokaryotic host cells (ix.. a normal or non-conditional 
origin of replication such as the CoUU origin and its 
derivatives). 
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'I he terms "scqucnoL-Kpccific rccombinasc" and "site- 
specific recombina.se" refer to enzymes thai recognize and 
bind to a short nucleic acid site or sequence and catalyze the 
recombination of nucleic acid in re I alio n to these .sites. 

'Ihc terms *ScqucTKX-sr>ccific rccombinasc target site" 
and "site-specific rccomhinasc target site" refer lii short 
nucleic acid site or sequence which is recognized by a 
sequence- or site -specific reenmbinase and which become 
the crossover regions during the site -specific recombi nation 
event. Examples of sequence -specific recombiaase target 
sites include, but are not limited to, lox sites, frt sites, att 
sites aod dif sites. 

The term " lox site" a* used herein refers to a nucleotide 
sequence at which the product of the ere gene of bacterioph- 
age PL. Cre recombinase, can catalyze a site-specific recom- 
bination. A variety of lox sites are known to the art including 
the naturally occurring loxP (tbe sequence found in the PI 
genome), loxB, loxL and loxR (these are found in tbe E. coli 
chromosome) as well as a number of mutant or variant lox 
sites such as k>xP5U, loxA86, loxA117, loxC2, loxP2, loxP3 
and loxP23. 

The term "frt site" as used herein refers to a nucleotide 
sequence at which the product of the FLP gene of the yeast 
2 fun plasmid, FLP recombinase. can catalyze a site -specific 
recombination. 

The term "unique restriction enzyme site" indicates that 
the recognition sequence for a given restriction enzyme 
appears once within a nucleic acid molecule. For example, 
the CcoRI site is a unique restriction enzyme site within the 
plasmid pUNMO (SEO ID NO:l). 

Arcsuicu'tm enzyme site is said to be located " adjacent to 
the 3' end or a sequencc-fcpeciftc recombinase target site" if 
the restriction enzyme recognition site is located down- 
stream of the 3' end of the sequence-specific recombi nase 
Urge! site. The adjacent restriction enzyme site may, but 
need not. be contiguous with the last or 3' nucleotide 
comprising the sequence-specific recombinase target site. 
For example, the CcoRI site of pUNI-10 is located adjacent 
(within 3 nucleotides) to the 3* end of the loxP site (see FIG. 
2U); the Xhol, Ndel, Ncul siUa arc also adjacent (ie., within 
about 10-150 nucleotides) to the loxPsitc but these sites arc 
not contiguous with the 3* end of the loxl* .site in pUNMO. 

Tbe terms "polylinker" or "multiple cloning sue" refer to 
a cluster of restriction enzyme sites on a nucleic acid 
construct which are utilized for the insertion and/or excision 
of nucleic acid sequences such as the coding region of a 
gene, lox sites, etc. 

The term "prokaryotic termination sequence** refers an a 
nucleic acid sequence which is recognized by the "RNA 
polymerase of a prokaryotic host cell and results in the 
termination of transcription. Prokaryotic termination 
sequences commonly comprise a OC-cich region that has a 
twofold symmetry followed by an AT-rich sequence [Stryer, 
supra]. A commonly used prokaryotic termination sequence 
is the T7 termination sequence. A variety of tenninatioa 
sequences are known to the art and may be employed in the 
nucleic acid constructs of the present invention including; 
mc T4.1* *l'i.z# '1*,** 'I'ju* **» T^4crnainatioo signals 

derived from the bacteriophage lambda [Lambda D, Ilendrix 
et al. t*ls.. «ipra] and termination signals derived from 
bacterial genes such as the up jteoc of A. culi [Stryer, supra]. 

Tbe term "eukaryotic polyadenylalioo sequence'* (also 
referred to as a "poly A site- or "poly A sequence'*) as used 
herein denotes a DNA sequence which directs both the 
termination and polyadcoylation of the nascent RNA tran- 
script Efficient polyadenylauion of the recombinant tran- 



,808 

8 

script is desirable as transcripts lacking a poly A tail are 
unstable and arc rapidly degraded. Ihc poly A sitjnaJ uliluoxJ 
in an expression vector may be "heterologous" or "endog- 
enous." An endogenous poly A signal is one that is fouod 
5 naturally at the 3* end of the coding regioo of a given gene 
in the geoome. A heterologous poly A signal is one which is 
one which is Isolated from one gene and placed 3' of another 
gene. A commonly used heterologous poly A signal is the 
IS V40 poly A signal Ihc SV40 poly A signal is contained on 
10 * 237 bp BamHI/Bcn restriction fragment and directs both 
termination and polyadenylaiion [J. Sambn»ok, supra, at 
16.6-16.7]; numerous vectors contain the SV44) poly A 
signal [e.g.. pCEP4, pREP4, pEBVHis (Iovilrogeo)]. 
Another commonly used heterologous pory A signal is 
is ae " vc d from tbe bovine growth hormone (DGII) gene; the 
DGII poly A signal is available oo a number of coouDcrciaJly 
available vectors [e.g., pcDNA3.1, pZco$V2. pSccTag 
(Invitrogen)]. The poly A signal from the Herpes sim plcx 
virus thymidine kinase (IISV tk) gene is also used as a poly 
2y A signal on expression vectors; vectors conUining the USV 
tk poly A signal include the ptiK-CMV, pBK-KSV. 
pO 1*1 3CZAT vectors from Stralagcnc. 

As used herein, the terms "selectable marker" or "select- 
able marker gene" refers to tbe use of a gene which encodes 
25 an enzymatic activity that confers the ability to grow in 
medium lacking what would otherwise be an essential 
nutrient (e.g., the TRP1 gene in ycasl cells); in addition, a 
selectable ma deer may confer resistance to an antibiotic or 
drug upon the cell in which the selectable marker is 
ju expressed. A selectable marker may be used to cooler a 
particular pbenotypc upon a host cell When a bust cell must 
express a selectable marker to grow in selective medium, the 
marker is said to be a positive selectable marker (eg., 
antibiotic resistance genes which confer the ability to grow 
3S in the presence of the appropriate antibiotic). Selectable 
markers can also be used to select against host cells con- 
taining a particular gene (e.g.. the sacB gene which, if 
expressed, kills tbe bacterial host cells grown in medium 
containing 5% sucrose); selectable markers used in this 
40 manner are referred to as negative selectable markers or 
counter-selectable marker*. 

As used herein, the term "vector" is used in reference to 
nucleic acid molecules thai transfer ON A segments) fmm 
one cell to another. The term "vehicle" is sometimes used 
45 interchangeably with -vector." A "vector" is a type of 
"nucleic acid construct." The term "nucleic acid construct" 
includes circular nucleic acid constructs such as plasmid 
constructs, plasmid constructs, cosmid vectors, etc as well 
as linear nucleic acid constructs (e.g.. X phage constructs, 
so PCR products). The nucleic acid construct may comprise 
expression aignsht such as a promoter and/or an enhancer (in 
such a case it is referred to as an expression vector). 

The term "expression vector" as used herein refers to a 
recombinant DNA molecule containing a desired ceding 
55 sequence and appropriate nucleic acid sequences necessary 
for the expression of tbe opcrably linked coding sequence in 
a particular host organism. Nucleic acid sequences neces- 
sary lor expression in procaryotes usually include a 
promoter, an operator (optional), and a ribosomc binding 
mi site, often along with other sequences. Cocaryotic cells are 
known to utilize promoter*, enhancers, and termination and 
polyadcmylatioo signals. 

The terms "in operable combination", "in operable order" 
and "opcrably linked** as used herein refer to the linkage of 
tfS nucleic acid sequences in such a manner that a nucleic acid 
molecule capable of directing the transcription of a given 
gene and/or the synthesis of a desired protein molecule is 
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produced. 'Oie term also refers in ihc linkage of amino acid 
sequences in such a manner so l ha I a functional protein i.s 
produced. 

The terms "transform a I inn** and *t ran* feci ion" as used 
herein refer to ibe introduction of foreign DNA into proltary- 5 
olic or cucaryotic cells. Transformation of proleaxyolic cells 
may be accomplished by a variety of means known to the art 
including the treatment of host cells with CaO, to make 
competent cells, clectroporation. etc. Trans feet ion of 
eukaryotic cells may be accomplished by a variety of means 
known to the art including calcium phosphate-DMA 
co-precipitation, I ) A bi-dcx Iran- mediated transfcclion, 
polybrene-mediated transfection, electroporation, 
microinjection, liposome fusion, Itpofection, protoplast 
fusion, retroviral infection, and biolistics. 

As used herein, the terms "restriction cndoouclcascs** and 
M rcslrictk>n enzymes" ruler to bacterial enzymes, each of 
which cut double -slrandcd ON A at or near a .specific nucle- 
otide sequence. 

As used herein, the teem "recombinant UNA molecule" as 20 
used herein refers to a DNA molecule which is comprised of 
segments of DNA joined together by means of molecular 
biological teebjiiqucs. 

DMA molecules are said to have "5' ends" and "3* enoV 
because mononucleotides are reacted to make oligooucle- 25 
o tides in a manner such that the 5' phosphate of one 
mononucleotide pentose ring is attached to the 3' oxygen of 
its neighbor in one direction via a pboepbodiester linkage. 
Therefore, an end of an oligonucleotides referred to as the 
**? end** if its 5* phosphate is not linked to the 3* oxygen of 30 
a mononucleotide pentose ring and as the 44 3* end" if its 3* 
oxygen is not linked to a 5* phosphate of a subsequent 
mononucleotide pentose ring. As used herein, a nucleic acid 
sequence, even if internal to a larger oligonucleotide, also 
may be said lo have 5' and 3* ends. In either a linear or 35 
circular ON A molecule, discrete elements arc referred to as 
being "upstream'* or 5* of the "downstream" or 3* elements. 
This terminology reflects the fact that transcription proceeds 
in a 5' to 3' fashion along the DNA strand. The promoter and 
enhancer elements which direct transcription of a linked 4n 
gene are generally located 5' or upstream of the coding 
region. However, enhancer elements can exert their effect 
even when located 3* of the promoter element and the coding 
region. Transcription termination and polyadenylation sig- 
nals arc located 3* or downstream of the g^oHing region. 45 

The 3* end of a promoter element is said to he located 
upstream of the 5* end of a sequence-specific recombinase 
target site when (moving in a 5* to 3' direction along the 
nucleic add molecule) the 3* terminus of a promoter element 
(the transcription start site is taken as the 3* end of a 50 
promoter element) precedes the 5* end of the sequence- 
specific recombinase target site. The 3' end of the promoter 
element may be located adjacent (generally within about 0 
U> 5(M> hp) U> the 5* end of the sequciicc-<qiccifK; recombinase 
Ur^;l silc; such an arrangement is used when Ihc pHUST 55 
vector is not intended to permit the expression or a tram- 
lational fusion with the ijenc of interest donated by a pUNI 
vector. Alternatively when the pH<XS I* vector is intended to 
permit the expression of a tr anal at tonal fusion, the 3* end of 
the promoter element ts located upstream of both the on 
sequences encoding the aminc-^rminus of a fusion protein 
and the 5* end of the sequence-specific recombinase target 
site; in this case, the 5* end of the sequence-apeciiic recom- 
binase target site is located within the coding region of the 
fusion protein (e.g., located downstream of both the pro- <S5 
motcr clement and the sequences encoding the affinity 
domain, such as Gst). 
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As used herein, the term **an oligonucleotide having a 
nucleotide sequence crxxKling a gene" means a nucleic acid 
sequence comprising the coding region of a gene or in other 
words the nucleic acid sequence which encodes a gene 
product. The coding region may be present in either a 
cDNA, genomic DNA or RNA form. When present in a 
DNA form, the oligonucleotide may he single-stranded (i.e., 
the sense strand) or double-stranded. Suitable control ele- 
ments such as eabaocers/promotcrs, splice junctions, poly- 
adenylation signals, etc. may lie placed in close proximity to 
the coding region of the gcoc if needed to permit proper 
initiation of transcription and/or correct processing of the 
primary UNA transcript. Alternatively, the coding region 
utilized in the vectors of the present invention may coo La in 
endogenous enhancers/promoicns, splice junctions, inter- 
vening sequences., polyadenylation signals, etc, or a combi- 
nation of both endogenous and exogenous control elements. 

As used herein, the term "regulatory element" refers to a 
genetic element which controls some aspect of the expres- 
sion of nucleic acid sequences. For example, a promoter is 
a regulatory element which facilitates the initiation of tran- 
scription of an operably linked coding region. Other regu- 
latory elements arc splicing signals, polyadcnylalioo signals, 
termination signals, eta (defined infra) 

Transcriptional control signals in eukaxyotes comprise 
"promoter" and "enhancer" elements. Promoters and 
enhancers consist of short arrays of DNA sequences that 
interact specifically with cellular proteins involved in tran- 
scription [Maniaris, T. et aL, Science 236:1237 (1987)]. 
lYomotcr and enh an cvr elements have been isolated from a 
variety of cucaryotic sources including genes in yeast, insect 
and mammalian cells and viruses (analogous control 
elements, ix., promoters, arc also found in procaryotcs). The 
selection of a particular promoter and enhancer depends on 
what cell type ts In be used to express the protein of interest. 
Some cucaryotic promoters and enhancers have a broad host 
range while others are functional in a limited subset of cell 
types [for review see Mmk, S.D. et al, Trends ftuxhem. ScL 
U:287 (1986) and Maniatis. X et aL, supra (1987)]. For 
example, the SV40 early gene enhancer is very active in a 
wide variety of cell types from many mammalian species 
and has been widely used for the expression of proteins in 
mammalian cells fDykema, R. et al., EMBOJ. 4:761 (1985) 
\ 'lWo other examples of prooxotcc/ciihanecx elements active 
in a broad range of mammalian cell types arc those from the 
human elongation factor 10 gene [Uetsuki, T. et aL,«f. BioL 
Chm, 264:5791 (1989), Kim, O. W. et al., fVcnc 91:217 
(1990) and Mfnishima, S. and Nagata, S., Nuc Acids. Rex, 
18:5322 (1990)] and the king terminal repeats of the Rous 
sarcoma virus [Oorman, C. M. et al., fVwc NaiL Acad. ScL 
USA 79:6777 (1982)] and the human cytomegalovirus 
pjoshart, M. et al., CeU 41:521 (1985)]. 

Aft used herein, the term M pmrnoter/enhancer" denotes a 
segment of ONA which contains sequences capable of 
providing both promoter and *-»»'»'ktt functions (Lc., the 
functions provided by a promoter element and an enhancer 
element, see above for a discusskm of these function*). For 
example, the long terminal repeals of retroviruses contain 
both promoter and enhancer functions. The enhancer/ 
promoter may he "endogenous** or "exogenous" or "heter- 
ologous." An "endogenous" ennancex/promoter is one which 
is naturally linked with a given gene in the genome. An 
"exogenous" or "heterologous" enhaoocr/pronooter is one 
which is placed in juxU position to a gene by means of 
genetic manipulation (ix^ molecular biological techniques) 
such that transcription of that gene is directed by the lmv^rf 
enhancex/promoter. 
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The presence of "splicing signals'* on an expression 
vector often results in higher levels of expression of Ihc 
recombinant transcript Splicing signals mediate ihc removal 
of introns from the primary KN A transcript aod consist of a 
splice dooor and acceptor site [Sambrook, J. et al., Molecu- 5 
tar Ctoning: A Laboratory Manual, 2nd ed.. Cold Spring 
Harbor laboratory Press, New York (1989) pp. 16.7-1 <L8J. 
A commonly used splice donor and acceptor site is the splice 
junction from the 16S RNAof SV40. 

Efficient expression of recombinant DNA sequences in 1Q 
coca ry otic cells requires expression of signals directing the 
efficient termination and polyadenylafioo of the resulting 
traascript- Transcription termination signals arc generally 
found downstream of the polyadeoylation signal and are a 
Tew hundred nucleotides in length. The term "poly A Kile" or 15 
44 poly A sequence" as used herein denotes a DNA sequence 
which directs both the termination and polysdcnylation of 
the nascent UNA transcript. Efficient polyadenylation of the 
recombinant transcript is desirable as transcripts lacking a 
poly A tail are unstable and are rapidly degraded. The poly ^ 
A signal utilized in an expression vector may be "heterolo- 
gous** or "endogenous." An endogenous poly A signal is one 
that is found naturally at the 3* end of the coding region of 
a given gene in the genome. A heterologous poly A signal is 
one which is one which is isolated from one gene and placed 25 
3* of another gene. 

Eucaryotic expression vectors may also contain "viral 
rep I icons'* <>r "viral origins of replication." Viral repKcons 
are viral ON A sequences which allow for the cxlrachrorno- 
somal replication of a vector in a host cell expressing the jy 
appropriate replication factors. Vtctors which contain either 
the SV40 or polyoma virus origin of replication replicate to 
high copy number (up to 10* copies/cell) in cells that express 
the appropriate viral T antigen. Vectors which contain the 
rep li co os from bovine papillomavirus or Epstein-Barr virus 35 
replicate exftachjomesomally at low copy number (-100 
copies/cell). 

As used herein, the terms "nucleic acid molecule 
encoding." "DNA sequence encoding.* and "DNA encod- 
ing" refer to the order or sequence of dcoxyribonuclcotides 40 
along a strand of deoxyribonucleic acid. The order of these 
dcoxy ribonucleotides determines the order of amino acids 
along the polypeptide (protein) chain. The DNA sequence 
thus codes Cur the amino acid sequence. 

Ai used herein, the term "gene" means the deoxyriho- 45 
nucleotide sequences comprising the coding region of a 
structural gene and the including sequences located adjacent 
to the coding region 00 both the 5* and 3' ends for a distance 
of about 1 kb on either end such that the gene corresponds 
to the length of the full-length mRNA. The sequences which 50 
are located 5' of the "r^ng region and which are present on 
the mRNA arc referred to as 5* non-translated sequences. 
The sequences which are located 3* or downstream of the 
coding region and which arc present on the mRNA are 
referred to as 3' non-translated sequencer; these sequences. 55 
The term "gene" encompasses both cDNA and genomic 
forms of a gene. A genomic form or clone of a genu contains 
the coding region interrupted with non-coding sequences 
termed "introns" or "intervening regions* or "intervening 
sequences." Tn Irons are segments of a gene which are an 
transcribed into nuclear RNAfhnRNA); introns may contain 
regulatory elements such as enhancers. Introns are removed 
or "spliced our" from the nuclear or primary transcript; 
introns therefore are absent in the messenger RNA(mRNA) 
transcript. The mRNA functions during translation to 65 
specify the sequence or order of amino adds in a nascent 
polypeptide. 
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In addition to containing introns, genomic forms of a gene 
may also include sequences located on both the 5' and .V end 
of the sequences which are present on the RNA transcript. 
Itiese sequences are referred to as " Nanking" sequences or 
regions (these flanking sequences are located 5' or 3* to the 
non-translated sequences present on the mRNA transcript). 
The S flanking region may contain regulatory sequences 
such as pro mote is aod enhancers which control or influence 
the txti ascription of the gene. Ihc 3' flanking region may 
contain sequences which direct the termination of 
uranscriptiou, post -transcriptional cleavage and polyadeny- 
lation. 

As used herein, the term "purified" or "to purify" refers to 
the removal of contaminants from a sample. For example, 
recombinant Cre polypeptides are expressed in bacterial host 
cells (as a Gst-Cre fusion protein) and the Cre polypeptides 
are puriikd by the removal of host cell proteins; the percent 
of recombinant Oc polypeptides is thereby increased in the 
sample. 

The term "recombinant DNA molecule" as used herein 
refers to a DNA molecule which is comprised of segments 
of DNA joined together by means of molecular biological 
techniques. 

The term "recombinant protein" or "recombinant 
polypeptide" as used herein refers to a protein molecule 
which is expressed from a recombinant DNA molecule. 

The term "native protein" as used herein to indicate that 
a protein does not contain amino acid residues encoded by 
vector se qu ences; that is the native protein contains only 
those amino acids found in the protein as it occurs in nature. 
A native protein may be produced by recombinant means or ■ 
may be isolated from a naturally occurring source. 

As used herein the term "portion" when in reference to a 
protein (as in "a portion of a given protein'*) refers to 
fragments of that protein. The fragments may range in size 
from four amino acid residues to the entire amino acid 
sequence minus one amino acid. 

As used herein, the term "fbsioo protein" refers to a 
chimeric protein containing the protein of interest (Le„ the 
Cre protein) joined to an exogenous protein fragment (the 
fusion partner which consists of con- Cre protein sequences). 
The fiisioo partner may enhance solubility of the protein of 
interest as expressed in a bust cell, may provide an affinity 
tag to allow purification of the recombinant fusion protein 
from the host cell or culture supernatant, or both. If desired, 
Ihc fusion protein may be removed from the protein of 
intercut (ie., the Oc protein) by a variety of enzymatic or 
chemkal means known to the art. 

DESCRIPTION OF THE INVENTION 
Ihc present invention provides reagents and methods 
which comprise a system for the rapid siibcfeiiing of nucleic 
acid sequences in vivo and in vitro without the need to use 
restriction enzymes. This system is referred to as the 
Univector Fusion System. The basis of the Univeclor Fusion 
System is a vector termed the Univector or the pUNI vector 
into which sequences encoding a gene of interest (cl)NA or 
genomic) are inserted. The pUNT vector has a sequence- 
specinc recombtnase largol site, such as a loxPsite, preced- 
ing the insertion site for the gene of interest, a selectable 
marker gene (this feature is optional) and a conditional 
origin of replication that is active only m host cells express- 
ing the requisite transacting replication factor. The pUNI 
vectors are designed to contain a gene of interest but lack a 
promoter for the expression of the gene of interest. Ihc gene 
of interest may be cloned directly into the pUNI vector (i e., 
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the pUNI vector may be used is a cloning vector, particu- gene), BW20979 (this strain contains ihc pir K cne> 
Urly for the cloning of cDNA libraries) or a previously BW21037 (this sir* in contain* the pir-116 gene) and 
cloned gene of interest may lie inserted (i.e., sulicloned) into RW2KTW (this strain contains the pir gene) (Metcalf et aj 

Ihe pUNI vector. supra). 

Using a sequence-specific recombinasc (e.g., Ore 5 other conditional origins of rep licaiion suit able for use on 
recombinasc). a precise fusion of the pUNI vector into a the pUNI vectors of the present invention include- 
second vector containing another copy of ll* sequence- 1} ^ RK2 oriy ^ ^ ' 
snecfic recomb.na.se 4arget s.te found on the pUNI vector « ^rjq oriV tct ^ ns % ln J^ c0a ^™J£oM*^ 
catalyzed. Ihe second vector, referred U> genencally as a lr r a «~r~ T^JT^t .1 /iom\ i w t » ■ ■ ^ Z^TZZ^ y 
pIIOST vector, is an expression vector that contains the io trf ^ ^ne [Ay«s et »l. (1993) j. Mol . Blo |. 230:174); 
sequence-specific recombinasc target site downstream of the ' . - bacteriophage PI on which requires the repA 
promoter element contained within the expression vector. TS^JP* rc P l,caU<, ° P* 1 cl O 9 * 6 ) Mol. Hiol. 
Following the site-specific recombination event which 19Z.Z75J; 

occurs between the single sequence -specific recombinasc ^) the origin t>f rep hWi on of the plasm id pSCI 01 (ATCC 

target sites located on each vector (e.g, the pUNI vector and is 37032) which requires a plasm id encoded protein, rep A, for 

the pIIOST vector), the two vectors arc stably fused in a replication [Sugiura et aL (1992) J. DacterioL 175: 5993], 

manner thai places the expression of the gene of interest The pSClOl ori also requires iHF, an E. cofi protein. E. coli 

under the control of the promoter clement contained within strains carrying the bimA and himD (hip) mutants (the him 

the expression vector. 'Iliis fusion event also occurs in a **** ft 6005 encode subuote of HIT) cannot support 

manner thai retains the proper translation*! reading frame of 2U pSClOl rcphcatioo [Stcnzcl ct aL (1987) CcU 49:7091 

the gene of interest. This subcloning event occurs without *) the bacteriophage lambda ori which requires the 

the need to use restriction e n7y mes, lambda O and P proteins [lambda TI, Kendrix et al. Eds., 

Ibc fusion ui recombination event can be selected for by CoU S P™*& Harbor Press, Cold Spring Ilarbor, N.Y. 
selecting for the ability of host cells, which do not express 

the trans-acting replication factor required for replication of 25 5) pKK322 and other (2olgl derivatives will not replicate 

the conditional origin contained on the pUNI vector, to in polA mutants of E. coli and therefore, these origins of 

acquire the selectable phenotype conferred by the selectable replication and be used in a conditional manner [Orindlcy 

marker gene (if present) on the pUNI vector. The pUNI and KeUey (1976*) MoL Gen. Genet. 143:3 11 and 

vector cannot replicate in cells that do not express the 6) rep! ication-mermosensitive plasm ids such pSU739 or 

trans-acting replicatioa factor and therefore, unless the 30 pSU30<) which contain a thermcwensitive rep 1 icon derived 

pUNI vector has integrated into the second vector that from pLasmid pSClOl, rep pSClOl" which comprises oriV 

contains a non-conditional origin of replication, pUNI win [Mendiola and de la Crux (1989) Mol. 

be lost from the host cell. MicrobioL 3:979 and Francis and Lobo (1996) J. Bad. 

*Ibe Univector Fusion System allows any number of 178:894 J. pSU739 and nSLBOO are stably maintained in E. 

expression constructs containing the gene of interest present coil strain DHSo (Otbco HRI,) al a growth temperature of 

on the pUNT vector to be made rapidly (e.g., within a single 3<T (42* <~ is non-permissive Tor replication of this 

day). Using conventional cloning or subcloning techniques rep I icon). 

which employ restriction enzyme digestions), the produc- other cooitiUooil origim of replication, including other 

uon of a single expression vector containing a gene of temperature sensitive rcphcons. arc known to thclrt and 

mtcrcst can take several days (tor the design and construe- may he employed in the vectors and methods of the present 

Uon of each expression vector). In contrast* once a battery of invention. 

VCC ^° re mU ? Cud !° CUn * in . ^•PP™P*** b) Sequence-Specific Keeombmases And Target Kccog- 

scqucncc-spccific recombmasc target site is made, a gene of nuitin Sites ** 

interest can be transferred to any number of expression ~ _ . . 

vector* in an afternoon using the Univcctor fusion System. 45 ™ bc f wce f , . tbc pUNI vcc,oc 

nG. 1 provides a schematic illustrating certain elements nasc^JSS l^^L " * rwombi - 

of the pUNI vectors and the Umvectort^sion Systern. ^l^S^*. n5Combnusc » "° jnzymcs Out recognize 

v /—i j*. • , ^ - . * « ™ * specific DNA site or sequence (referred to herein generi- 

a) Conditional Origins of Rcphcatioo and Suitable Host cally as a -scqucncc-spccinc recombinasc target talc*) and 

~* . . so catalyzes the recombination of DNA in relation to these 

Conditional origins of replication are origins which sites. Site -specific rccomhinases are employed Tor the 

require the presence or expression of a trans-acting factor in recombination of ON A in both prokaryotcs and cukaryolcs. 

the host cell for replication. A variety of conditional origins Examples of site -specific recombination include 1) chromo- 

of replication functional in prokaryotic hosts (e^ E. coll) somaJ rearrangements which occur in Salmonella typhimu- 

are known to the art. The present invention is illustrated but 33 rium during phase variation, inversion of the FTP sequence 

not limited by the use of the R6K T origin, oriR, from the during the replicatioa of the yeast 2 fan circle and in the 

pLasmid R6K. The R6Xy origin requires a trans-acting rearrangement of uiimunoglobulin and T cell receptor genes 

factor, the II protein mippucd by the pir genu [Metcalf et al in vertebrates, 2) integration of bacteriophages into the 

(1996) inasmid 35:1} E. coU strains containing the pir geno chromosome of prokaryotic host cells to form a lysogco and 

will support replication of R6Ky origins to medium copy «i 3) Uansposilion of mobile genetic elements (c g 

number. A strain containing a mutant allele of pir. pir-116, transposons) in both prokaryotcs and cukaryotcs. 1 Iks term 

will allow an even higher copy number of constructs con- "site specific recc*rjbinasc~ refers to enzymes that recognize 

taming the R6K T origin. short DNA sequences that become the crossover regions 

E. coli strains that express the pir or pir- 116 gene product during the recombination event and includes 

include DW18815 (ATCC 47079; this strain contains the 65 transpimases and ratcgrases. 

pir-116 gene), BW 19094 (AiXXT 4708O; tins strain oonUins ITic present invention is illustrated but not limited by the 

the pn gene), BW20978 (this strain conUins the pir-116 use of vectors conta ining loxP sites and the rccooabina tion of 
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these vectors using Lbc Crc rccombina.se of bacteriophage II. 
*lne (>e protein catalyzes rccombi nation of UNA between 
two loxP sites [Sternberg ct al (1981) Cold Spring Harbor 
Symp. Ouaot. Biol. 45:297]. The loxl» sites may be present 
on the same DNA molecule or they may be preseal on 
different DNA molecules; the DNA molecules may be linear 
or circular or a combination of both. The loxP site consists 
of a double-stranded 34 bp sequence (SCO ID NO: 12) which 
comprises two 13 bp inverted repeat sequences separated by 
an 8 bp spacer region [Hocss et al. (1 982) Proc. Natl. Acad. 
Sd USA 79:3398 and VS. Pat. No. 4.959,317, the disclo- 
sure of which is herein incorporated by reference]. The 
internal spacer sequence of the LoxP site is asymmetrical and 
thus, two loxP sites can exhibit directionality relative to one 
another [Hocks el al. (1984) Proc. Natl. Acad. Sci. USA 
81:1026]. When two loxl* sites on the same DNA molecule 
are in a directly repeated orientation, Crc excises the DNA 
between these two sites leaving a single loxl* site on the 
DNA molecule [Abrcmski el al. (1983) (ill 32:1301 ]. If two 
loxP sites are in opposite orientation on a single DNA 
molecule, Cre inverts the DNA sequence between these two 
sites rather than removing the sequence. Two circular DNA 
molecules each containing a single loxP site will recombine 
with another to form a mixture of monomer, dimcr, trimer, 
etc. circles. The concentration of the DNA circles in the 
reaction can be used to favor the formatioa of monomer 
(lower concentration) or mullimcric circles (higher 
concentration). 

(Circular DNA molecules having a single loxP site will 
recombine with a linear molecule having a single loxP site 
to produce a larger linear molecule. Cre interacts with a 
linear molecule containing two directly repeating loxP sites 
to produce a circle containing the sequences between the 
loxl* sites and a single loxl* site and a linear molecule 
containing a single loxP site at the site of the deletion. 

'I Tie Ore protein has been purified to homogeneity 
[Abrcmski et al. (1984) J. MoL Diol. 259:1509) and the cre 
gene has been cloned and expressed in a variety of host cells 
[Abremski et aL (1983), supra]. Purified Cre protein is 
available from a number of suppliers (e.g., Novagen and 
New England Nuclear/Du Pont). 

The Ore protein also recognises a number of variant or 
mutant lox sites (variant relative to the loxP sequence), 
including the loxH, loxl, and loxK sites which arc found in 
the /?. coli chromosome [Hoes* et al. (1982), supra]. Other 
variant lox sites include kixPSU [S'-ATAACTTCOTATA 
GTr\TACATTArACCAAGTTAT»3* (SEQ ID NO. 16); 
spacer region underlined; I loess et al. (1986), supra], k>xC2 
[5-ACAAC TTCCTATA 
ATCTArg CIAIACGAAGTrAr-3' (SEQ ID NO:17); 
spacer region underlined; U.S. Pa. No. 4,959,317). Cre 
catalyzes the cleavage of the lox site within the spacer region 
and creates a six base-pair staggered cot [I loess and Abrcm- 
Kki (19K5) J. Mill. Hiol. 181351]. the two 13 bp inverted 
repeat domains of the lox site represent binding sites for the 
Cre protein. If two lox sites differ in their spacer region* in 
such a manner that the overhanging ends of the cleaved 
DNA cannot rcanncaJ with one another, Crc cannot effi- 
cient ly cataly?» a recombination event using the two differ- 
ent lox sites. For example, it has been reported thai Cre 
cannot recombine (at least not efficiently) a loxP site and a 
ioxP511 site; these two lox sites differ in the spacer region. 
Two lox sites which differ due to variations in the binding 
sites (ie., the 13 bp inverted repeats) may be recombined by 
Cre provided that Cre can bind to each of the variant binding 
sites; the efficiency of the reaction between two different lox 
sites (varying in the binding sites) may be less efficient that 
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between two lox sites having the same sequence (the effi- 
ciency will depend on the degree and the location of the 
variations in the binding sites). For example, the loxC2 site 
can be efficiently recombined with the loxP site; these two 
5 lox sites differ by a single nucleotide in the lefi binding site. 
A variety of other site-specific recombinases may be 
employed in the methods of the present invention in place of 
the Cre recombinase. Alternative site-specific recombinases 
include: 

10 1) the K1 J* recomhinase of the 2pi plasmid or Saccharr>- 
myces cerevLOae [Cox (1983) Proc. Natl. Acad. .Sci. USA 
80:4223] which recognize the frt site which, like the k>xP 
site, comprises two 13 bp inverted repeats separated by an 
8 bp spacer [5 1 <S AACHTQCriXrTC TCTAG AAA C T 

15 ATACGAACTTC-3XSEQ ID NO: 18); spacer underlined]. 
The FLP gene has been cloned and expressed in £. coli (Cox, 
supra) and in mammalian ceils (PCT International Patent 
Application PCTl US 92/01 899, Publication No.: WO 
92/15694, the disclosure of which is herein incorporated by 

20 reference) and has been purified [Mcycr-Ican et al. (1987) 
Nucleic Acids lies. 15:6469; Babincau et al (1985) J. Biol. 
Chem. 260:12313; Gronoslajski and Sadowski (1985) J. 
Hiol. Chem. 260:1 232Xfc 

^ 2) the Int recombinase of bacteriophage lambda (with or 
without Xis) which rcoognixc* all Kites (Weinberg el al. In: 
Lambda U, supra, pp. 211-250); 

3) the xerC and xerD recombinases of E. coU which 
together form a recombinase that recognizes the 28 bp dif 
site [Leslie and Sherratl (1995) UMBO J. 14: 1561 J 

4) the Int protein from the conjugal ive transposon Tn9 1 6 
[Lu and Churchward (1994) UMBO J. 13:1541]; 

5) Tpnl and the p- lactamase trmnsposons [T^evesque 
(1990) J. Kaeleriol. 1723745J 

35 6) the Tn3 resoh/ase (Tlanagan et al (1989) J. Mol. DioL 
206:295 and Stark cl al. (19K9) CasW 58:779]; 

7) the SpoIVC recombinase of Bacillus subtiUs [Sato et al 
J. Bacterid. 172: 1092 J 

8) the Ilin recombinase [Galsgow et al (1989) J. DioL 
40 Chem. 264:10072]; 

9) the On recombinase [H after cl al. (1988) b'MKO J. 
7^991t and 

10) the immunoglohulin reuombinases[MaJynn cl al. Cell 
45 (1988)54:453]. 

c) Modification of Expression Vectors 
As discussed above, pUNI vectors are used to transfer a 
gene of interest into a suitably modified expression vector 
via site-specific recombination. The modified expression 

so vectors or host vectors used in the Univector Fusion System 
are referred to as plIOST vectors. pIIOST vectors are 
generally expression vectors (e.g., plasrnids) which have 
been modified by the insertion of a sequence-speciflc rccom- 
binasc targcl site (eg., a lox site). The presence of the 

55 sequence -specific reevmbinase target sue oo the pHOST 
plasmid permits the rapid suhcJoning or insertion of the gene 
interest contained within a pUNI vector to generate an 
expression vector capable or expressing the gene or interest. 
The pHOST vector may encode a protein domain such as an 

6ii affinity domain including, hut not Kmited to, glulalhionc-S- 
trans&rase (Gst), maltose binding protein (MBP), a portion 
of staphylococcal protein A (SPA), a polybistidioc tract, etc. 
A variety of commercially available expression vectors 
encoding such affinity domains are known to the art. The 

«5 affinity domain may bo located at either the amino- or 
carboxy-terminus of the fusion protein. When the pHOST 
plasmid contains a vector-encoded affinity domain, a fusion 
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protein comprising the vector-encoded affinity domain and 
the protein of interest is generated when the pUNl and 
pHOST vectors are recomhi ned. 

*lb generate expression vector* intended to generate tran- 
scriptional fusions (i.c„ pHOST does not contain a vector- 
encoded protein domain), a sequence -specific recombinase 
target site is placed after (i.e.. downstream of) Lhc start of 
transcription in toe host vector; this is easily accomplished 
using synthetic oligonucleotides comprising the desired 
sequence-specific recombinase target site. In designing the 
oligonucleotide coca prising the sequence -specific recombi- 
nasc target site, care is taken to avoid introducing an AXXJ or 
start oodon that might initiate translation inappropriately. 

To generate expression vectors intended to generate a 
fusion protein between a vector-encoded protein domain 
located at the amino-tcrminus of the fusion protein and the 
protein of interest (encoded by the gene of interest contained 
wilhin the pUNl vector) (i.e., a translation*! Tuition), care is 
taken to place the scq u ence -specific rceumbinasc target site 
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I "he host cell employed will not express lhc trans- acUnji 
factor required for replication or the conditional origin of 
replication contained within the pUNl vector for aucrna- 
Uvcly the host ocU wfll be grown at • temperature which is 
noo-permissrve for replication of a temperature sensitive 
repUcoo contained within the pUNl vector). The host cells 
will be grown under conditions which select for the presence 
of the selectable marker contained within the pUNl vector 
(e g., growth in the presence of kaoamycin when the pUNl 
vector contains the kanamycin resistance gene). Plasmid or 
noo-ehromosomal UNA is isolated from host ecus which 
display the desired pbeootype and subjected to restriction 
crwymc digestion to confirm that the desired fusion event 
has occurretL 

e) Recombination in Procaryotic Hort Cells 
'llic fusion of a pUNl vector and a pHOST vector may be 
accomplished in vivo using a boat cell that cxprcsscx the 
appropriate site-specific rccombinasc (e.g., Cre 
rceomhina.se). The host cell employed will lack the ability to 



lcb io piacc tne scq ucnoc-spcciljc rceumbinasc target site i no imki ecu employed will lack the ability to 

the correct reading frame such that 1) an open reading 20 cx P r » **** trans-acting factor required for replication of the 
me is mainUined ihmuiih the scaucncc-NTiecific rccum- conditional origin of replication contained within the nlTNT 
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frame is mainUined through the sequence -specific rceum- 
binasc target situ on pHOST and 2) the reading frame in the 
sequence-specific recombinase target site on pHOST is in 
frame with the reading frame found on the sequence -specific 
recombinase target site contained within the pUNl vector. In 
addition, the oligonucleotide comprising the sequence- 
specinc recombinase target site on pIIOST is designed to 
avoid the introduction of in-frame stop codoos. The gene of 
interest contained within the pUNl vector is cloned in a 
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conditional origin of replication contained wilhin the pUNl 
vector (or alternatively the host cell will be grown at a 
temperature which is non-permissive for replication of a 
temperature sensitive replicon contained within the pUNl 
vector). 

The pUNl vector and the pHOST vector are cotrans- 
rormcd inu> the host cell using a variety of methods known 
to the art (eg., transformation of cells made competent by 
treatment with Cad*, electroporaiion, etc.). The cotrartx- 



vwuicuau wuoid im pvm vwwr is cjooco in a »*— wjiwai ion t ac.;. i ne co trans- 
particular reading frame so a* to facilitate the creation of the 30 formed host cells are grown under conditions which select 
desired fiicirm nrntoin for the onsaence nf rim Ml«H«ki* m ..4*» __*_: ■ 



desired fusion protein. 

The modification of several expression vectors is pro- 
vided in the examples below to illustrate the creation of 
suitable pIIOST vectors. The general strategy involves the 
generation of a linker «n«i«^ n g the desired sequence- 
specific recombinase target site (e.g., a lox site) by annealing 
two complementary oligonucleotide*. The annealed oligo- 
nucleotides form a linker having sticky ends which arc 
compatible with ends generated by restriction enzymes 
whose sites are conveniently located in the parental expres- 
sion vector (c.g„ within the pulylinkcr of the parental 
expression vector). 

d) In Vitro K combination 



The fusion of a pUNl vector and a pIIOST vector may be a* u ***** • ^ BMy ^ *° . transfcned ( us "« eithcranin 
wompUshcd in vitro using a purified preparation of a W ™ V, ?r n ^ ombin * Ucm reaction) into any cxprus- 



accompUshcd in vitro using a purified preparation of 
site-specific recombinase (e.g., Qc rccombinasc). inc pUNl 
vector and toe pIIOST vector are placed in reaction vessel 
(e.g., a microcentrifuge tube) in a buffer compatible with the 
site -specific rccombinasc to be used. Fur example, when a 
Cre recombinase (native or a fusion protein form) is 
employed, the reaction buffer may comprise SO mM * Iris- 
HQ (pll IS), 10 mM MgCij, 30 mM Nad and 1 mg/mi 
RSA. When a FTP recombinase is employed, the reaction 
buffer may comprise 50 mM THs-HO (pH 7.4X 10 mM 
MgClz, 100 /ig/ml DSA [GronosUjski and Sadowski (1985) 
260:12320]. The concentration of the pUNl vector and the 
pIIOST vector may vary between 100 ng to 1.0 ><g of each 
vector per 20 /d reaction volume with about 0.1 /a* of each 
nucleic acid construct (0.2 ><g mul) per 20 ^d reaction being 
prcficrrcd. lhc concentration of the site -specific rccombinasc 
may be rite red under a standard set of reactioo conditions to 
find the optimal concentration of cn/.ymc to he ufted as 
described in Ex. 4. 

Following the in vitro fusion reactioo, a portion of the 
reaction mixture is used to transform a suitable host cell to 
permit the recovery and propagation of the fused vectors. 



for the presence of the selectable marker contained within 
the pUNl vector (eg., growth in the presence of kanamycin 
when the pUNl vectnr contains the kanamycin resistance 
gene). Flasmid or non-chromosomal DNA is isolated from 
host cells which display the desired pbeootype and subjected 
to restriction enzyme digestion to confirm that the desired 
fusion event has occurred. 

In addition to permitting the rapid transfer of a gene of 
interest from a particular pUNl vector containing a gene of 
interest into a pIIOST vector, the Unfvector Fusion System 
permits the rapid exchange of an entire cDMA library to a 
variety of expression vectors. The cDNAUbraxy is generated 
using a pUNl vector as the cloning vector (a pUNl library). 
The entire library may then be transferred (using either an in 

Mini H* III U*u» Wm** m. *m — t-Z .J V • 
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sum vector modified to contain a scqucocc-epcciiic recom- 
binase urge! Kile (o.(^ a lox site) (i.c, into a pHOST vector). 
'ITiis solves an existing problem in the art, hi that there is no 
way, using existing vector systems, in exchange the inserts 
in a library made in one expression vector en masse (i.e., as 
an entire library) to a different expression vector. In addition, 
the sequences conrairwl within a pUNl Ubrary can be used 
to recombinc with linear X. constructs (which can then be 
used to isolate specific genes by complementation of an 
appropriate host cells such as E. coU or 5. cerevisiae mutant 
cells). Partner as deserted in Example 8, toe in vivo gene 
trap method, a variation of the Unfvector Fusion System, can 
be used to transfer linear DNA fragments that lack a select- 
able marker auch a* a IH-'K product into a variety of 
expression vectors. 

HXl'KRlMHJNriAL 

The following examples serve to illustrate certain pre- 
ferred embodiment* and aspectsof the present invention and 
are not to be construed as '■''"f'ing the scope thereof. 

In the experimental disclosure which follows, the follow- 
ing abbreviations apply: »C. (degrees Centigrade); g 
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(gravitational Held); vol (volume); ON A (deoxyribonucleic 
acid); RNA (ribonucleic acid); Icdal or kD (Icilodaltnns); OD 
(optical density); ET)TA (ethylene diamine tetra-acetic acid); 
A*, coli (Kschorichia ct>li% SI)S (sodium dodecyl sulfate); 
PACE (poly aery Umide gel electrophoresis); p (pUsmid); 
F-B (Toiria-Rertani medium: per liter 10 g Ractn-tryptnne, 5 
g yeast extract, 10 g NaCl, pi I to 7.5 with NaOII); ml 
(milliliter); /d (microliter); M (Molar); mM (miilixDOlax);^M 
(microMolar); g (gram); y% (microgram); ng (nanogram); (J 
(units) , ml) (miUiu aits); min. (minutes); sec. (seconds); % 
(percent); bp (base pair); kfo (kilobase); PCR (polymerase 
chain reaction); Tris (u^s(hyx]ruxymcthyl^aaiiDOiDcthanc); 
PMSF (pbcnylnwlbylsutfooybluoridc); USA (bovine scrum 
albumin); IK IX J (Lsopropyl-fl-l>-lhiogalacloside); AITX: 
(American Type Culture Collection, Rock vi lie, McL); Hx>- 
Rad (Bio-Rad Corp., Hercules, Calif.); Invitrogen 
(Invitrogen, Corp., San Diego, Calif.); Hew England 
Nuclear/1 )u Pont (Hoston, Mas*.); Nov age n (Novagen, Inc., 
Madison, Wis.); Pharmacia or Pharmacia Biotech 
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Nucleotides 1-400 of pUNMO conuin the conditional 
origin of replication from RtfKv (OriR,^); the OriR^ 
was derived from the plasmid R6K (ATCC 371 20) [Melcalf 
el al. (1996) Plasmid .15:1 \ nucleoudes 401-414 comprise 
a Notl-Kpol polyliakcr thai facilitates the exchange of lox 
sites; pUNl-IO contains a wild -typo loxP site (as dLscussed 
above, pUNI vectors containing modified lox sites may be 
employed). Nucleotides 415-448 comprise the wdd-rypc 
loxP site; nucleotides 449-527 comprise a polylinkcr used 
for the insertion of the gcoc of interest (genomic or cDNA 
sequences). Nucleotides 528-750 contain the polyA addition 
sequence from bovine growth hormone (UGH) [the UGH 
polyA sequence is available on a number of commercially 
available vectors including pcl)NA3.1 (Invitrogen)]; the 
HC5H polyA sequence is provides a 3 T end for transcripts 
expressed in mammalian and other eukaryotic cells. The art 
is aware of other eukaryotic polyA sequences which may be 
used in place of the BOH polyA sequence (e.g., the SV40 
poly A sequence, the TK polyA sequence, etc.). Nucleotides 



(Pharmacia Biotech, Piscataway, NJV); Pbaxmingeo 2U 751-890 contain the T7 terminator sequence which is used 



(PharMingen, San Diegi. Calif.); Gibco BRL (Gaithersburg. 
Md.); and Stratageoe (Stratagene Cloning Systems, La JoQa, 
Calif.). 

EXAMPLE 1 

(construction Of Uni vector ConstrucLs 

In l his example, an illustrative Urn vector construct is 
provided, the pUNl-10 vector, which conUins a loxl* site, a 
kanamycin resistance gene (Kn&) and the R6Kv conduional 
origin of replication (OriRj,^). The OriR irfUrr is functional 
only in E. coli strains expressing the B replication protein 
(i.e., the product of the pir gene). A gene of interest is placed 
within pUNI-10 (either as a result of constructing a library 
in pUNI-10 or by subcloning a previously cloned gene of 
interest). Once the gene of interest is contained within 
pUNI-10, any number of plasmid expression constructs 
containing this gene of interest can be constructed rapidly 
(e.g., within a single day). The expression constructs will 



to terminate transcription in prokaryotic hosts (numerous 
prokaryotic termination signals are known to the art and may 
be employed in place of the T7 terminator sequence). 
Nucleotides 890-895 comprise an EcoRV restriction 
25 enzyme recognition site and nucleotides 896-2220 comprise 
the kanamycin resistance gcoc (Kan or Kn&) from Tu5 
which provides a positive selectable marker. The Kir* j^nc 
found on pUNl-10 wax modified using site-directed 
mutagenesis to remove the naturally occurring Neol site 
30 such that pUNI-lO contains a unique Ncol site in the 
polyi inker region located at nucleotides 449-527. pUNT 
vector* need not contain a Kn<& gene (modified or wild- * 
<ypc); other selectable genes may be used in place of the 
KnXfc gene (eg., ampicillin resistance gene, tetracycline 
35 resistance gene, zeocin™ resistance gene, etc). The pUNI 
vector need not contain a selectable marker, although the use 
of a selectable marker is preferred. When a selectable 
marker is present on the pUNI vector, this marker is prcf- 
VHfr. wiuixo . ™ge u.y;. ine expression e^nsirucis wvu crably a different selecUble marker than that present on the 
contain an antibiotic resistance gene other than kanamycin ^ p HOST vector ^ 
(e.g., airrpicillin). Using the site-specific recombiniixc, <>u. 



a precise fusion between the pUNI vector and any other k»xP 
site -containing vector comprising the desired expression 
signals adjacent to the loxP site is catalysed. The sale- 
speople recombination event which occurs between the 
single loxP sites located on each plasmid (e.g., pUNI and the 
expression vector) results in the stable fusion of these two 
plasmids in such a manner as to place the expression of the 
gene of interest under the control of the expression »if>t*u 
contained within the expression vector. This subcloning 
event occurs without the need to use restriction enzymes. 
The fusion of pUNI- 10 and the expression vector is selected 
fur by selecting for the ability of E. coS cells that do not 
express the U protein to grow in the presence of kanamycin. 
pUNI cannot replicate in H. coli cells that do not express the 
II protein unless pUNI has fused or integrated into another 
plasmid that contains a ourxnal (ie., not a conditional) origin 
of replication (e.g., the Col El origin); in this case, pUNI will 
be replicated (as part of the fusion plasmid) and kanamycin 
resistance will be conferred on the host cell, 
a) Generation of pUNMO 

FIG. 2A provides a schematic map of the pUNI-10 vector, 
the locations of selected restriction enzyme sites are indi- 
cated (with the exception of NotI, all sites shown are 
unique). FIG. 2B shows the DNA sequence of the loxP site 
and the polylinkcrs contained within pUNl-10 (i*c, nucle- 
otides 401-530 of SEO ID NO:l). 



The nucleotide sequence of pUNI-10 is provided in SCO 
ID NO:l. 

EXAMPLE 2 

(Construction Of Host II asm ids hor Use In I "he 
Unrvector Plasm id-Fusion System 

Host plasmids used in the Uni vector plasmid fusion 
system are referred to as pHOST plasmids. pHOST plasmids 
or vectors are generally expression vectors which have been 
modified by the insertion of a lox site. The presence of the 
lox site on the pHOST plasmid permits the rapid subcloning 
or insertion of the gene interest contained within a pUNT 
vector to generate an expression vector capable of express- 
ing the gene of interest The pHOST vector may encode a 
protein domain such as an affinity domain including but not 
limited to, gfrtatoiooc-S- transferase (Gst), maltose binding 
protein (MBF), a portion of sUphylocoocal protein A (SPA), 
a polymstidinc tract, etc A variety of coaimcrciauy avail- 
able expression vectors encoding such affinity domains are 
known Lo the art. When the pHCXST plasmid contains a 
vector-encoded alunily domain, a fusion protein comprising 
the vector-encoded affinity domain and the protein of inter- 
est is generated when the pUNI and pHOST vectors are 
re combined. 

To generate expression vectors intended to generate tran- 
scriptional fusions (Le., plIOST does not coo tain a vector- 
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encoded protein domain), a lox site is placed after (ijC, 
downstream of) the start of transcription in the h<*U vector; 
this is easily accomplished using syntactic oligonucleotides 
com prising the desired lox site. In designing the oligonucle- 
otide comprising the lox site* care is taken to avoid intro- 5 
dttcing an ATG or start codoo that might initiate translation 
inappropriately. 

To generate expression vectors intended to generate a 
fusion protein between a vector-encoded protein domain and 
the protein of interest (eocoded by the gene of interest 
contained within the pUNI vector), care is taken to place the 
lox site in the correct reading frame such that 1) an open 
reading frame is maintained through the lox site on pHOST 
and 2) the reading frame in the lox trite on pHOST is in frame 
with the reading frame found 00 the lox site contained within 
the pUNI vector. In addition, the oligonucleotide comprising 15 
the lox site on pHOST is designed to avoid the mtroduction 
of in-frame stop codons. The gene of interest contained 
within the pUNI vector is cloned in a particular reading 
frame so as to facilitate the creation of the desired fusion 
protein. 20 

The modification of several expression vectors is pro- 
vided below to illustrate the creation of suitable pHOST 
vectors, in each case, the general strategy involved the 
generation of a linker containing a lox site by annealing two 
complementary oligonucleotides. 'the annealed oligonuclc- 
otitic* form a linker having sticky ends which are compatible 
with ends generated by restriction enzymes whose sites arc 
conveniently located in the parental expression vector (eg., 
within the poly linker or the parental expression vector). 

a) Modification of the pGHX-ZlXcs Procaryotic Exprcs- 30 
sion \fector 

pGft£X-2l*Kca is an expression vector active in #£. cod 
cells which is designed Cor inducible, intracellular expres- 
sion of genes or gene fragments as fusions with Gst. 3J 
pGEX-2TKcs contains the IPTG- inducible tac promoter 
(Pm() and was derived from pG EX-2TK (Pharmacia 
Biotech) as follows. The polylinker sequence of pGEX- 
2TK, 5 , -GGATCCCCGGGAATTC-3* (SEO ID NO:2), was 
replaced with the following sequence: S'-CGAT ^ 

cGCA•rA^GCccATaoercaAOGA^•ccoAAl-rc-3 , 

(SEO ID NO:3) to generate the pGEX-2TKcs vector. 

A linker containing a toxFsile was generated by annealing 
the following oligonucleotides: 5 '^^XGGCIATAACT 
TCGTATAGCATACATTATACGAA GTTATO-3' (SEQ ID 45 
NO:4) and S'-GATCCATAACTTCGTATAATGTATGC 
TATACGAAGTTATAGC-3* (SEO ID NO:5). When 
annealed, these two oUgouucJeotides form a double-stranded 
linker having a 5* end compatible with an Ncol sticky end 
and a 3* end compatible with a Da ml Q sticky end (FIG. 3A). 50 
pGHX-2TKcs was digested with Ncol and tiamHl (HQ. 3B) 
and the annealed loxP linker was inserted to Conn pgsl-lox. 

b) Modification of the pVf J 392 Raculovirus Expression 
Victor 

pVL13°2 is an expression vector (hat contain* the poly- 55 
hedrin promoter which is active in insect cells 
(I 'harm in gen). A linker con taming a loxP idle was generated 
by annealing the following oligonucleotides: 
S'-flflCCGfl ACOTCATAACTTCOTAT 
AGCATACATTATACnAAOTTATO-y (SEQ ID NO:6) and «i 
5 ' - G ATC CATAACTTC GTATAATCTATCCTATACG 
AACTTATGACGTCC-T (SEQ ID NO: 7). When annealed; 
these two oligonucleotides fioon a double-stranded linker 
having a 5' end compatible with a Not! sticky end and a 3' 
end compatible with a D ami n sticky end (FIG. 4A). 45 
p VL1392 was digested with Noil and BamHl (HU. 4B) and 
the annealed loxP linker was inserted to form pVL1392-lox. 
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c) Modification or the pGAI*24 Yeast Expression Vector 
pGAl>24 is an expression vector that is based 00 the yeast 

2 /im circle and contains the constitutive GAP 
(glyccraldchydc 3-ph4isphalc dchydntgena.se) promoter 
(Pcu^) which is active in yeast cells and the TRP1 gene (used 
a selectable marker when the cells are grown in medium 
lacking trynumhan) [the GAP promoter is available on 
pAD23; Schilds (1990) Proc. NalL Acad. Sci. USA 
87:29 16 J. A linker containing a loxP site was generated by 
annealing t he fo llowing oligonucleotides: S'-TCGAGAC 
GTCATAACTTCGTATAG C ATAC AXTATACG AAGTTA 
TOC-y (SHQ ID NO:8) and S'-GGCCYJCAlAACiTCGlA 
'^AAiX ^ lAl*GClXlACGAAG'^"iAl'GACG'iXJ-3 , (SEO ID 
NO:9). When annealed, these two oligonucleotides form a 
doublc-ittxanded linker having & ? end compatible with a 
Xhul sticky end and a 3* end compatible with a Noll siicky 
end (FIG. 5A). pGAP24 was digested with Xhol and Ni>U 
(HG. 5B) and the annealed loxP Hnkcr wu inserted to form 
pCAP24-lox. 

d) Modification of the pG AM 4 Yeast Expression Vector 
p<)AM4 is a yeast ccntromeric expression vector that 

contains the GAI , promoter (Pc*^), which w induced by the 
presence of galactose in the medium, and the TRPI gene. A 
linker containing a loxP site was generated by annealing 
together the oK^inodeolide.% listed in SEQ ll> Nf>S:8 and 9. 
When annealed, these two oligonucleotides form a double- 
stranded linker having a 5* end compatible with a Xhol 
sticky end and a 3' end compatible with a Nod sticky end 
(FIG. 6A). pGALl4 was digested with Xhol and NotI (FIG. 
6B) and the annealed loxP linker was inserted to form 
P GALl4-lox. 

EXAMPLE 3 

Expression And Purification Of A Gst-Cre Fusion 
Protein 

In order to provide a source <»r purified tie rccombiiuua; 
for the in vitro recombination of plasmids, the ere gene was 
irwrtcd into a Gat expression vector such that a fusion 
protein comprising Gst at the amino-terminaJ end and Ore 
rccombina.se at the cirboxy- terminal end was produced. 'I "he 
Gst-Cre fusion protein was purified by chromatography 
using Ghatathione Sepharose 4R (Pharmacia). 

llic ere gene was isolated by polymerase chain reaction 
(PCR) amplificaiinn using the pi asm id pRS39 (US. Pal. No. 
4,959317). U-S. Pat. Nos. 4,683,195, 4.6H3.202 and 4,965, 
188 cover PCR methodology and are incorporated herein by 
reference. The primers used in the PCR were designed to 
introduce an Ncol rile at the first ATG in the ere open 
reading frame. The PCR product was cloned into a TA 
cloning vector (PCRH.1; Invitrogen) and then was subd- 
loncd as an NcoI-EcoRI fragment into pGEX-2TKcs (Ex. 2) 
to generate pOL123. The ligation products were used to 
transform DHSu ecus and the desired recombinant was 
isolated and used to transform BL21(DE3) cells 
(Invitrogen). 

The nucleotide sequence of the Gst-Cre coding region 
within pOL123 is listed in SCO ID NO: 10. The amino acid 
sequence of the fusion protein expressed by pQ!_123 is listed 
in SEO ID NO:ll. 

To express the Gst-Cre fusion protein, DL21(DE3) cells 
containing the pOL123 plasmid were grown at 37* C. in LD 
containing 100^g/ml ampicillin until the OD^ reached 0.6. 
Expression of the fusion protein was then induced by the 
addition of lllti to a final concentration of 0.4 mM and the 
cells were allowed to grow overnight at 25* C. Following 
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induction, ihc bacterial cell* were pelleted by ccnlrifugation 
*i 5,000xg at 4° C. and the supernatant wis discarded. A cell 
lysale was prepared as follows. Cells harvested fmmO.5 liter 
of culture were suspended in 35 ml of a solution containing 
20 mM Tris-IICl, pi I 8.0, 0.1M NaCU 1 mM EDTA, 0.5% 
Nonidet P-40, 5 pg/mi of each of leupeptin, antrpain, apro- 
tinin and I mM PMSF at 4° C. Tbe cells were incubated for 
10 min on ice and then disrupted by sooication (3x15 sec 
bursts) using a sonicator (Ultrasonic Heat Systems Model 
200R) at full power. The lysate was then clarified by 
centrifugatioo at 12,000 rpm using a SS34 rotor (Sorvall). 

The Gsi-Cre fusion pmlein was affinity purified fmm the 
cell lysate by chromatography on Glutalhione Sepharose 4D 
(Pharmacia) according to tbe manufacturer's instructions. 
The protein concentration of Gst-Cre was determined by 
Bradford analysis (DioRad). 

Aliquot* of the cell lysate before and after chromalogra- 
phy on Glutathione Sepharose 4H were applied in an SOS- 
PAGE gel. Following electrophoresis, the gel was stained 
with Cnomassie blue. The stained gel is shown in FIG. 7. In 
FIG. 7, lanes 1 and 2 contain tbe cell lysate before and after 
chromatography, respectively. Tbe arrowhead indicates the 
Gst-Cre fasioo protein. Tbe migration of tbe molecular 
weight protein markers is indicated to the left of lane 1. The 
results shown in FIG. 7 demonstrate the purification of the 
Gsl-Oc fusion protein. This fusion protein was shown to be 
functional (i.e., capable of mediating recombination 
between lox sites) in the in vitro recombination assay 
described below. 

EXAMPLE 4 

In Vitro Recombination Using Ihc Univcctor 
Plasmid Fusion System 

'1 he Univcctor Plasmid Fusion System permits the in vitro 
recombination of two plasmid*. FIG. 8 provides a schematic 
showing the strategy employed for in vitro recombination. 
pA represents a generic pUNI vector which contain* a kixP 
site, a kanamycin resistance gene and the conditional R6TC 
origin that is only functional in E. coli strains expressing the 
till protein (e.g., E. coli strains BW18815, BW19094, 
BW20978, BW20979, BW21037,BW21038). pB represents 
a generic pHOST vector which contains a lozl J site, an 
ampic £11 in resistance gene and a Col CI origin of replication. 
pAB represents the fused pUsmirl which results from the 
Cre- media ted fusion of pA and pB. 

7b illustrate tbe in vitro recombination reaction, pUNI-5 
(a pUNI vector which differs from pUNI-10 only in that i 
pUNI-5 retains tbe Ncol site in tbe Kn4P gene and -contains 
a different poly linker) was employed as pA and pOL 103, an 
amptcillin-resistant plasmid containing a loxP site and the 
(i»l El origin, was employed as pH. In a UilaJ reaction volume 
of 20^1, 0.2 /ig of each pUNi-5 (pA) and pQLlU3 (pB) were 
mixed in a buiTer containing 50 mM THs-HCI (pH 7.5), 10 
mM Mg(.\, 30 mM NaO and 1 mg/ml BSA. Ihc amount 
or purified Gst-Cre (Bx. 3) wan varied from O to 1 .0 ^j. 'I he 
reactions were incubated at 3T" C for 20 minutes and then 
the reactions were placed at 7fT C. for 5 min. to inactivate 
the Gst-Cre protein. Five micro liters of each reaction mix- 
ture were used directly to transform competent DIISci cells 
(Cad, treated). Tbe transformed cells were plated onto 
LB/ Amp (100 fig/ml amp) and LB/Kan (40 ^g/ml kan) 
plates and tbe number of arnpicUlin resistant (Ap<S>) and 
kanamycin-rcsistant (Kntt) colonics were counted. Ihc 
results are summarized in Table 1. 
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1.9 X 10* 
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i.l * 10* 


682 


«l2 


0J 


1-5 x 10* 


502 


X3 


OS 


03 m 10* 


104 


3.4 


u) 


x io* 


52 


1.7 



The results shown in Table 1 demonstrate, that under 
these reaction conditions 0.05 >ig purified Gst-Cre per 20 pi 
reaction yields the most efficient rate of plasmid fusion. 
Plasmid DNA was isolated from individual kanamycin- 
rcsistant colonics (using standard mini-prep plasmid DMA 
isolation protocols) and subjected to restriction enzyme 
digestion to determine the structure of tbe fused plasmids. 
Ihis analysis revealed that plasmid UNA isolated from the 
kanamycin-rcsisUnt colonics represented a dimcr created by 
the desired fusion of pUNI-5 and pQI/103 via the loxP sites. 
ThcKc results demonstrate that the Uni vector Plasmid Fusion 
System can he used 20 to rapidly fuse two plasmid* together 
in vitro. 

bXAMPli* 5 

In Vitro Fusion Between A pUNI Vector 
Containing A Gene (H Interest And A I .ox- 
Containing Expression Vector Produces A Fused 
Vsctor Capable Of Expressing The Gene Of 
Interest 

In Example 4 it was demonstrated that the Univectnr 
Plasmid Fusion System can be used to rapidly fuse two 
plasmid constructs together in vitro. In this example, the 
ability of the Univcctor Plasmid Fusion System to fuse two 
plasmids together tn a manner that places tbe gene of interest 
contained on tbe pUNI vector under the transcrq>uonal 
control of a promoter contained on the pHOSTor expression 
vector in such a manner that a functiooal protein of interest 
is expressed from the fused construct. 

a) Insertion Of A Gene Of Interest Into Tbe p UNI- 10 
Vector 

Ihc cON A encoding the wild-type yeast Skpl protein [Kai 
et ai. (1996) Cell 86:263] was cloned into the pUNI-10 
vector between the Ndel and RamHI sites to generate 
pUNI-Skpl; the yeast SKP] cDNA sequence is available as 
GenBank accession no. U61764. Skpl is an essential protein 
involved in the regulation of the cell cycle in yeast Yeast 
cells conutning a temperature sensitive mutant of Skpl 
cannot grow at tbe non-permissive temperature (37* C). 

b) In Vitro Fusion Reactions And Complementation 
Assays 

pUNl-Skpl was rccombined with pOAP24-Lox (Ex. 2) and 
pGAI .14-lox (Ex. 2) using the in vitro reaction described in 
Bx. 4; 0.2 m of (fctt-Cre was used per 20 /d reaction. The 
resulting plasmid fusions were termed pGA 1*24 -Skpl and 
pGAI.14-Skpl. pOAP24-Skpl and pGAM4-Skpl were then 
AO transformed into the temperature sensitive (is) skpt-11 
mutant yeast strain Y555 (Bai ct al, supra) and the trans- 
formed yeast cells were plated onto SC-tryptophan plates (to 
select for the expression of the selectable marker TRP1) and 
incubated at either a permissive (25* C) or rjon-ocrmissive 
65 temperature (37* C)s the plates which received yeast cells 
transformed with pGAL14-Skpl contained galactose. Ihc 
ability of the transformed cells to grow at the non- 
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permissive temperature is dependent upon the expression of 
the wild -type skpl gene encoded by a properly fused pUNI- 
Skpl/czprcssioo vector construct. As a control, the yeast 
SKP1 genomic clooc contained in a URA CEN vector 
(produced by cooveouonal dotting techniques) was used to 5 
transform the ts skpl-11 mutant yeast strain Y555 and the 
transformed cells were also plated at 25° C. and 37* C. In 
each case, an expression vector (e.g., pRS414 or pRS415; 
Bai ct aL, supra) lacking the SKP1 gene but containing the 
same selectable marker (ie., TRP1) as either pGAP24-Skpl, 
pG AM4-Skpl or URA CEN^Skpl was used Ui Iran-Hform 
Y555 cells as a control capable of permitting the growth of 
transformed Y555 cells 00 selective medium at the permis- 
sive temperature. 

ITic results demonstrated thai the URA CKN-SKJI con- 1$ 
struct produced by conventional cloning techniques pro- 
duced 11 functional Skpl protein which was capable of 
complementing the lethality of the SKKI-11 Is mutation. 
More importantly, the results demonstrated that the in vitro 
fusion reaction that created pGAP24-Sfcpl and pOAM4- w 
Skpl produced constructs capable of producing functional 
Skpl; that is, Y555 cells transformed with either pGAP24- 
Skpl or pCAL14-Skpl were capable of growth at 37* C, a 
temperature at which the ts Skpl-11 protein produced by the 
host strain is noo -functional, expression vectors lacking the 25 
SKP1 cDNA were incapable of complementing the lethality 
of the skpl-11 ts mutation. 

c) Restriction Analysis, SDS-PAGE Analysis and Western 
Hlol Analysis of In. Vitro t-'usion Reactions 

pUNl-Skpl was rccumbincd with pGst-lox (Ex. 2) using yj 
the in vitro reaction described in Hz. 4; 0.2 >Jg of Ost-Crc 
was used per 20 fi\ reaction. The resulting plasm id fusion 
was termed p<iST-Skpl. HG. 9 A provides a schematic 
showing the starting constructs and the predicted fusion 
construct. Five microliters of the fusion reaction mix lore 35 
was used transform DHSu cells as described in Ex. 4. *IT>c 
transformed cells were plated onto LO/Amp/Kan plates and 
plasm id DNA was isolated from individual Aptf&Kn® colo- 
nies. The plasmid DNAs were digested with PstI followed 
by electrophoresis 00 agarose gels to examine the structure 40 
of the fused plasm ids. A representative ethidium bromide- 
sUioed gel is shown in FIG. 9D. In PIG. »B, lane "M" 
con tains DNA size markets, lanes pUNl-Skpl and pgst-lox 
cunUin the starting pla&mids digested with l*sil and laocs 
1-12 contain plasmid UNA from individual Al*ftKn® colo- 45 
nies digested with PstI. Lanes marked with an indicate 
that these colonics contained a trimcric fusion plasmid that 
resulted from the fusion of two Gst-lox pla&mids and one 
pUNI-Skpl plasmid. The sizes of the two PstI fragments 
which result from the fusion of pUNI-Skpl and ptpa-lox in 50 
kb are indicated (5.8 and 2.0 kb). Toe results shown in FIG. 
9B demonstrate that the in vitro fusion reaction resulted in 
the production of the desired fused construct with high 
efficiency (about 83% of the plasmids in the ATOKna 
colonies comprised the fusion of one pUNI-Skpl vector with 55 
one pgst-lox vector). 

'three individual ApttKnft colonies were picked and 
grown in liquid cultures which were induced with IIMXJ to 
examine whether the fused construct (pGst-Skpl) could 
produce the desired Ost-Skpl fusion protein. The cultures w 
were grown, in du ced and cell extracts were prepared as 
described in Ex. 0. An aliquot of the cell tysates prepared 
from induced and uuindoced cells were electrophoresed 00 
an SDS-PAGE gel and the gel was either stained with 
Coomaise blue or transferred to nitroccUuiose to generate a 65 
Western blot Ihc Western blot was probed using an anti- 
Skpl polyclonal antibody (the antibody was raised against 
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the yeast Skpl using conventional methods). I'hc resulting 
Coomassie-stained gel and Western blot are shown in FIGS. 
10A and 10R, respectively. 

In KICi. I OA, lane "M" conUins protein molecular weight 
markers (size in kd is indicated). Lanes marked "C contain 
extracts prepared from E. coil containing a GST-SKPI 
construct made by conventional cloning [i.e., the SKPI 
cDNA was excised using restrict too enzymes and inserted 
into pGEX-2TKcs (Ex. 2)]. Lanes 1-3 contain extracts from 
AP®Kn<8> cells transformed with in vitro fusion reaction 
mixtures. Extracts prepared from uninduccd cells anrf rFTG 
induced cells are indicated by and ** V, respectively. *1 nc 
arrowheads indicate the location of the Gst^Skpl fusion 
proteins; the Gst-Skpl fusion product generated from the 
p< KST-nSKIM fusion construct contains 15 addilkmal amino 
adds which arc Located between the Gst domain and the 
Skpl protein sequences relative to the Gst-Skpl fusion pirv- 
tcin expressed from the conventionally conslructcd GST- 
SKP1 plasmid (the additional 15 amino acids axe encoded by 
the linker comprising the loxP site; see FTG. 3). In FIG. 10B, 
the lane designations arc the same as described for FIG. 
10A. This Western blot confirms that the bands '*'\'* : *-» t * A by 
the arrowheads in FIG. 10A represent Cst-Skpl fusion 
proteins. 

The results shown in FIGS. 10 A and 10D demonstrate that 
the Uoi vector Fusion System can be used to create an 
expression vector which m»innitw toe proper translation* I 
reading frame and permits the expression of a fusion protein 
comprising the expression vector-encoded affinity tag and 
the protein of interest. 

The above results demonstrate that the Unrvector Fusion * 
System can be used to recombine two plasmids, one con- 
taining a gene of interest but no promoter (this vector may 
optionally contain expression signals such as termination 
signals and/or polyaclenylation signals) and the other con- 
taining a promoter and optionally other expression signals 
(e.g., splicing signals, translation initiation codnns) (and 
optionally sequences encoding an affinity domain) but lack- 
ing a gene of interest, in vitro in such a manner thai the 
proper translational reading frame is maintained permitting 
the expression of a functional protein from the fused plas- 
mids in the host cell. 

bXAMPI-fc 6 

Construction Of An £1 coU Strain That Inductbly 
Expresses Cre Recombinase 

An K con* strain containing a cre gene under the control 
of an inducible promoter, t«rmrai tbc QLB4 strain, was 
constructed as follows. The cre gene was placed under the 
transcriptional control of the inducible lac promoter by 
inserting the cre ORF into a derivative of pNN402 [EUedge 
et al. (1 991) Proc. Natl. Acad. Sci. USA 88:1731]; pNN402 
was modified Ln conUin a lac prom 1 Her. This construct was 
then crossed onto lambda phage (e.g., Xgtll) using conven- 
tional techniques. The recombinant lambda phage carrying 
the lac-cre gene was integrated into the chromosome off. 
coU strain JM107 to generate the QLB4 strain. 

Expresskin of Cre recombinase was induced by growing 
QLD4 cells at 37* C. until an OD^ of 0.6 was reached. The 
culture was then split into 2 parts and IFTG was added to one 
part to a final concentration of 0.4 mM. As a control, the 
DNN132 strain (ATCC 47059; EUedge et aL (1991), supra) 
which contains the cre gene under the transcriptional control 
of the endogenous cre promoter was treated as described for 
the QLD4 strain. Cell extracts (total protein) were prepared 
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from all rour samples (Qt ,B4*Il"T(r and KNN132iU r IXi) 
and examined for expression of Oc rccombinx.se by Western 
blotting analysis. Tbc Western blot was probed using a rabbit 
polyclonal a tut- Cre antibody (Nova gen) as tbc primary 
antibody and a goal anti-rabbit IgC horseradish peroxidase 5 
conjugate (Amcrsbamn) as the seooodary antibody accord- 
ing in the manufacturer's instructions. FIG. 11 shows a 
Western blot containing extracts prepared from (shown left 
10 right) BNN123 cells grown io the absence of 11'IXJ ("CT) 
and QI.B4 cclLs grown in the absence ("QI,H4-") and 
presence of IPTG ("0LB4+"), respectively. The location of 10 
the Cre recombinase band is indicated by the arrowhead. 
The additional bands seen on this Wesrtern blot are due to 
cross-reactivity of the crude (ix., not affinity purified) rabbit 
ami -Ore antibody with bacterial proteins. 

Western blot analysis demonstrated that Cre protein could 1 5 
not be detected in BNN123 cells grown in the presence or 
absence of IPTG. Cre protein was detected in QLB4 cells 
grown in the presence of ll'IXt, but not in the absence of 
I Kit I, by Western blot analysis.' I liens fore, the expression of 
Cre recombinase in OLB4 cells is greatly induced by the 20 
presence of WVO in the growth medium. By 1*1 «■ analysis, 
the expression of Cre recombinase in OHM cells is depen- 
dent upon the induction of the lac-cre gene by IPTG. 
However, more sensitive functional assays indicate that the 
Cre protein i& expressed constitutive ly at very low levels in 25 
both HNN1 32 cell* and Ql -B4 cells in the absence of II' Hi. 
In these functional assays* a pUNT vector (Kntt) and a 
pHOST vector (Ap<&) were colransfomied into QT.B4 cells 
and the transformed cells were grown on plates containing 
kanamycin to select for the presence of the pUNI-pIIGST $g 
fusion pi asm id. Plasm id ON A was isolated from individual 
kanamycin-resistant colonies and subjected to restriction 
enzyme digestion to rxafninr the structure of the plasmid 
UNA. This analysis revealed that multiple xsoforms of tbc 
plasmid fusion product were present in the plasmid DNA 35 
isolated from any single kanamycin-resistant colony. While 
not limiting the present invention to any particular 
mechanism, it is believed that low level constitutive expres- 
sion of Cre recombinase leads to multiple fusion events 
between ihe pUNI and pHOST vectors resulting in the 40 
production of multimeric forms (ie., trimer, tctramer, etc) of 
the fused plasmid (the desired fused plasmid is a dimer 
formed by fusion of pUNI and pH(XVI> Ihe multimeric 
plasmid fusion products would be expected to be unstable 
due to the fact that the Cre protein is const j naively expressed 45 
in 0LB4 cells. 

To overcome the potential problems that low level con- 
stitutive expression of the cre gene in the host cell may 
cause, the expression of cre can be more tightly controlled 
as described below. In addition the approaches described 50 
below, the pUNI and pHOST vectors can be modified as 
described in Example 7 and these modified vectors can be 
fused using a host cell thai constitutive ty expresses the Cre 
protein. 

Ibc expression of Cre recombinase can be more tightly 55 
controlled by a variety of means. For example, the expres- 
sion of Ihe cre gene can he made conditional when express- 
ing cre under the control of the lax promoter by growing the 
host cells in medium containing glucose. The presence of 
0.2% glucose in die growth medium virtually shuts down «i 
transcription form the lac promoter. In addition, the lac 
promoter can be modified to insert additional operator (o) 
sites which bind the lac repressor. Other tightly controlled 
promoters are known to the art [e.g., the T7 promoter which 
requires tbc expression of 17 RNA polymerase; these pro- «5 
motes arc available on the pHT vectors (Novagcn)] and may 
be employed to control the expression of the cre gene. 
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In addition to placing the cre ORK under the control of a 
tightly controlled promoter, Cre expression can he lightly 
controlled by placing the cre gene on a plasmid containing 
a tempera turc~6cnsitrvc (ts) rcpliooo (e.g., rep pSCTOl^). 
When the cre gene is carried on a ts replication plasmid, Cre 
will be expressed during the transformation of tbc host cell 
(because the host cell containing the ts plasmid containing 
the cre gene was maintained at the permissive temperature) 
but will be absent following recombination of the pUNI and 
pHOST vectors when the host cell is grown at a temperature 
non-permissive for replication of the is replicon. 

EXAMPf-E 7 

In Vivo Recombination In Procaryottc Hc»sLs Using 
The Unrvcctor Fusion System 
As described in Example 6, coUansformation of E. coli 
cells expressing Cre protein (e.c^, QI-B4, RNN132) with a 
pUNI construct and a pHOST construct (each construct 
containing a single lox site) results in the fusion of these two 
constructs in vivo, if the host cell used for the recombination 
reaction coostinitrvely expresses the Ore protein, multimeric 
forms of the fused constructs are generated. In addition to 
the methods outlined above for lightly regulating the expres- 
sion of the cre gene in tbc host cell, cells constitutive ry 
producing <>c protein can be employed with modified pUNI 
and pHOST vectors as described in this example. *lnc pUM 
construct is modified such that two different lox sites flank 
the kanamycin resistance gene (the modified pUNI construct 
ix termed pUNI-D). The two lox sites differ in their spacer 
regions by one or two nucleotides and for the «»t™ of 
discussion the two different lox sites are referred to an 
"k>xA" and ToxB" (e.g., loxP and loxPSU; "loxb" is used ' 
in this discussion to distinguish it from the first lox site 
termed M k>xA" and docs not indicate the use of the loxB 
sequence found in the E. coli chromosome). Cre cannot 
efilciently catalyze a recombination event between a loxA 
tote and a loxb due to the sequence changes located in the 
spacer regions between tbc Cre binding sites; however Cre 
can efficiently catalyze the reoombinalion between two lox A 
Kites or two k>xH sites [Hocks el al. (1986) Nucleic Acids 
Res. 14:2287]. The pHOST construct is modified such that 
one loxA site and one loxB site flank the select ah I e marker 
gene (the modified pHOST construct is termed pHOST-D). 
In this example, pIIOST contains the sacD gene as the 
selectable marker (a negative selectable marker). The pres- 
ence of the sacD gene on pIIOST-D provides a means of 
counter-selection as ceOs expressing the sacD gene are killed 
when the cell is grown in medium containing 5% sucrose 
[Gay et aL (1985) J. Bacterid. 164:918 and (1983) J. 
DacterioL 153:1424]. 

^ FIG. 12 provides a schema t ic showing the strategy for in 
vivo recombination in a Cre -expressing host cell (e.g., 
QT.B4 cells) using the pUNl-O and pHOST-D construct^ 
Arrows are used to indicate the direction of transcription of 
various genes or gene segments in FIG. 12. In FIG. 12, the 
following abbreviations are used: An® (ampkallin resis- 
tance gene); Kn® (kaoamycin resistance gene); Ori (oon- 
oondiu'onal plasmid origin of replication); Ori<& (the R6K7 
conditional origin of replication); Cre (Cre recombinase); 
GENEX (gene of interest). The strategy outlined in FIG. 12 
is referred Ut as the **in vivo gene-trap**. HO. 12 illustrate* 
that the second lux site (au&U) in pUNl-U (relative to the 
design of the pUNT-10 vector) is inserted between the 
kanamycin resistance gene and the R6Ky conditional <m*gin 
of replication. 

To generate a pHOST-D construct, a commercially avail- 
able expression vector conUining the desired promoter (and 
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optionally enhancer) is modified as described in t£x. 2 lo 
insert the lox A site downstream of the pninnHcr (it is mil 
necessary that a commercially available expression vector 
be employed as the art is well iwirc of methods for the 
generation of expression vectors). Sequences encoding the c 
sacD gene [Cay el al. (1983) J. DacterioL 153:1424; Cca- 
Bank accession nos. X02730 and KOI 987] and the second 
lox site (loxD) are inserted downstream of the first lox site 
(loxA). 

Ibc pUNI-D and pHOST-D constructs arc co transformed 10 
into Ql .H4 cells (If x. 6) and the transformed cells arc plated 
onto l,H/Ap/Kn plates containing 5% sucrose to select for 
the desired recombinant. FTG. 12 illustrates the recombina- 
tion events which will occur in the presence of Cre in the 
QI.B4 cells. Krxl pUNI-l) and pHOST-l) will fuse U> form 15 
two (timers in which two possible double cross -over events 
can occur. These two double cross-over events are dia- 
grammed in FIG. 12. The double cross-over events will 
result in the exchange of the DNA segments that are Hanked 
by loxA and loxB to produce the plasmids labelled "A" and 2u 
"B." All plamsids which contain the sacB gene (the pIIOST- 
D, the fused plasmids and plasmid D) will be selected 
against by the presence of sucrose in the growth medium. 
U"hc pUNl-U construct will not be able to replicate in OLB4 
cells as these cells do not express the II protein required for 25 
replication of the U6Ky origin.' I "here fore, the only construct 
will he maintained in QLB4 cells selected on f.B/Kn con* 
Uining sucrose is the desired plasmid A in which the gene of 
interest from pUNI-O has hcen placed under the transcrip- 
tional control of the promoter located on pIIOST-D. ^ 

pUNI-10 was modified to place a second lox site, com- 
prising Ihc loxl*511 sequence (SBQ II) NO: 16) between the 
kanamycin resistance gene and the R6Ky conditional origin 
of replication to create pUNl-10-D. A second lox site, 
comprising the loxFSH site, was inserted onto a loxP- 35 
containing expression plasmid (ix., a pHOST vector) lo 
create a pIIOST-D vector. One-half of one microgram of 
each plasmid was cotransformed into competent Qf -B4 cells 
and an aliquot of the transformed cells were plated onto 
LD/Ap plates and onto LD/Ap/Kn plates containing 5% 4n 
sucrose and the number of colonies on each type of pi ale 
were counted. The percentage of AP<S>Kn42> colonies which 
grew on sucrose -000 uining plates relative to the number of 
Ap& 00 Ionics was 1% (lxlO^/lxlO*). Restriction enzyme 
digestion of plasmid UNA isolated from individual 45 
Ap&KW& colonies which grew on sucrose -containing 
plates confirmed that the desired fusions had been generated, 
'these results indicate that the in vivo gene trap method can 
be used to rocomhine a gene of interest carried on a pUNT-D 
vector into an expression vector using host cells that con- so 
stiutively express the Cre protein. 

In addition to providing a mean* Cor rccombining a gene 
of interest carried on a pUNl-D vector into an expression 
vector using host cells that constilub'vely express the Cre 
protein, the in vivo gene trap method provides a means to 55 
transfer a gene of interest contained on a linear DNA 
molecule (c.g„ a PCR product) thai lacks a selectable marker 
into an cxprcsskui vcctor(n). The desired IX ^ product is 
amplified using two primers, each of which encode a dif- 
ferent lox site (a "lox A" and **loxB" site such as a k>xP and 60 
loxPSll site). ApUNI vector is constructed that contains (5* 
lo 3') a LoxA site, a counter-selectable marker such as the 
sacD gene and a lox B site (ix., the two different lox sites 
Hank the counter-selectable marker). This pUNI vector also 
contains a conditional origin of replication and an antibiotic 65 
resistance gene as described above and in Ex. 1. The PCR 
product (loxA-amplified sequence -loxB) is recombioed with 
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the modified pUNI vector (which comprises loxA-counlcr- 
selectable marker-lnxR) to create a pUNI vector containing 
the PGR product which now lacks the counter-selectable 
marker; this recombination event is selected for by growing 
the host cells in medium which kills the host if the counter- 
selectable gene is expressed. The PCR product in the pUNT 
vector (containing 2 lox sites) can then be placed under the 
control of the desired promoter element by rccombining the 
p UNI/ PGR product construct with the appropriate pHOSl-D 
vector. 

EXAMPLE 8 

The Use Of Modified LoxP Sites lb Increase 
Expression Of The Protein Of Interest 

The pUNI and pIIOST constructs employed in the 
Unxvector Plasmid Fusion System were designed such that 
plasmid fusion results in the introduction of a lox site 
between the promoter and the gene of interest LoxP sites 
consist of two 13 bp inverted repeals separated by an H bp 
spacer region [Hoes* el aL (1982) Pruc Nad. Acad. Sd 
USA 79:3398 and U.S. Pal. No. 4,959,317]. Transcripts of 
the gene of interest produced Tram a pUNI-pHOST fusion 
construct »>rrrprising a loxPsilc may have two 13 nucleotide 
perfect inverted repeats within the 5* untranslated region 
(UTR) which have the potential to form a stem-loop struc- 
ture (this will occur in those cases where pHOST does not 
encode an affinity domain at the amnino-tennimts of the 
fusion protein). It is currently believed that the ribosome 
s c an n ing mechanism is the most commonly used mecha- 
nism for initiation of translation in eukaryotes (e.g., yeast 
and m a mm a lia n cells). Using this mechanism, the ribosome " 
binds to the 5* cap structure of the mRNA transcript and 
scans downstream along the 5* UTR searching for the first 
Alt! or translation start codon. Without limiting the present 
invention to any particular mechanism, it is possible that a 
slcrn-ktop structure formed by the presence of a k>xP 
sequence on the .V UTR of the mRNA encoding the protein 
of interest would block or reduce the efficiency ribosome 
scanning and thus the translation initiation step could he 
i m P *ired . There is evidence that stem-loop structures in the 
5' UTR of particular mRNAs reduce the efficiency of 
translation in eukaryotes [see, e.g., Donahue ct at (1988) 
MoL Cell. Diol. 8:2964 and Yoon et al. Genes and Dev. 
(1992) 6:2463]. It is noted that no evidence suggests that the 
presence of a stem-loop structure in the coding region (as 
opposed to the 5* U1*R) of a transcript negatively affects its 
ability to be translated. It is likely that the energy of protein 
synthesis is sufficient lo overcome secondary structures 
present in mRNAs. Indeed the daU presented in Ex. 5 shows 
thai a OST-SKPI fusion construct produced using the 
UnivccUir Fusion System (ix, the construct contains a lox I' 
site between the sequences encoding the Gst and Skpl 
domains) produced the same level of fusion protein as did a 
conventional construct encoding a Gsl-Skpl fusion protein 
which lacks the k>xP sequence. Therefore, concerns over the 
presence of a stem-loop structure caused by the presence of 
a lox sequence in a transcript encoded by a p UNI -p HOST 
fusion construct are limited to those constructs which do not 
generate fusion proteins. 

If low levels of expression are observed when a gene of 
interest is expressed from a pUNl-pIIOST fusion constructs 
comprising lox sequences that comprise perfect 13 bp 
inverted repeats (e.g., loxP), pUNI and pIIOST constructs 
containing mutated loxP sequences are employed. The 
mutated loxP sequences comprise point mutations that cre- 
ate mismatches between the two 13 bp inverted repeat 
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sequences within Ihc k>xP site which disrupt the form* lion 
of or reduce the stability of & stem loop structure. 
Specifically, two modified loxP sites were designed l hat 
have m ism niches ai difTercnt positions in the inverted 
repeats located within a loxP site. The 13 bp inverted repeals 5 
are binding sites for the Ore pmtein; thus, each loxP site has 
two binding sites for Cre. For the purpose of discussion, 
these two binding sites are referred to as L and R (left and 
right), lac wild-type LoxP site is designed L(O)-K(0) 
wherein "0" indicates the absence of a mutation (ie., the 10 
wild -type sequence). Two derivatives of the wild-type loxP 
sequence were designed and termed 10x1*2 and luxl'3. 'Vhc 
sequence of ioxi»2 (SEQ ID NO:13), tuxlO (SEt> U> 
NO: 14), as well as the wild-type loxP sequence (SUQ ll> 
N(>:12) an; shown in HO. 13. Ix>xl>2 is placed on the 15 
pUNT-IO construct (in place of the wild-type loxP site) and 
toxP3 is placed on the pHOST construct. 

I x>xP2 has repeats designated 1 <3,6)-tt(0) which indicates 
that the third and sixth nucleotides of the left repeat are 
mutated; thus, a mismatch is introduced a! the third and sixth 3U 
positions between the \. and R repeats of the k>xP2 ale. 
LoxP3 has repeats designated L(0)-R<9) which indicates that 
the ninth nucleotide on the right repeat sequence is mutated 
to introduce a mismatch at the ninth position between the L 
and R repeats of the loxP3 site. Fusion between the k>xP2 25 
site on the pUNJ construct and the k>xl*3 sale on the pHOST 
construct win generate a hybrid !oxl>23 site [L(3,6)-R (9)] 
located between the promoter and the gene of interest and a 
wild-type loxP Kile [1 <<>>-R<0)] at the distal junction. Thus, 
the 10x1*23 site (SHQ U> NO: 15) in Ihc 5* LTfR win have w 
three mismatches distributed at positions 3, 6 and 9 between 
the 13 nucleotide inverted repeals which arc expected to 
strongly destabilize the formation of the stem-loop structure. 
Other mutated loxP sequences suitable for disruption of the 
slem-lcxip structure will be apparent to those skilled in Ihe 3 * 
art; therefore, the present invention is not UmUnti to the use 
of the loxP2 and loxP3 sequences Cor the purpose of dis- 
rupting stem-loop formation on the 5' UTR of transcripts 
produced from pUNI-pIIOST fusion constructs. The suit- 
ability of any pair of mutated lox sites for use in Ihe 
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Univcctor Fusion system may be tested by placing one 
member of the pair on a pUNI vccUir and the other member 
on a pHOST construct. The two modified vectors arc then 
rc combined in vitro as described in Ex. 4 and the fusion 
reaction mixture is used to transform £. cuti cells and the 
transformed cells are plated on selective medium (c.g, on 
T. TV Amp and lJ3/Kan plates) in order U> determine the 
efficiency of rccornbinaliiin between the two mutated lox 
sites (Ex. 4). The efficiency of recombination between the 
two mutated lox sites is compared to the efficiency of 
recombination between two wad-type lox sites. Any pair of 
two different mutant lox sites that recom bines at a rate that 
is about 5% or greater than that observed using two ioxP 
sites is a useful pair of mutated lox sites for use in avoiding 
the formation of a stem-loop structure on the 5* UTR of the 
mRNA transcribed from the pUNl/pHOST fusion construcC 

!t will be apparent to those skilled in the art thai a similar 
strategy can be employed for the modification of frl site* 
when the FLP recombinase is employed for the recombina- 
tion event. The frt site, like Lox sites, contains two 13 bp 
inverted repeats separated by an 8 bp spacer region. 

It is clear from the above that the present invention 
provides methods Cur the subcloxung of nucleic acid mol- 
ecules that permit the rapid transfer of a target nucleic acid 
sequence (eg, a gene of ratercsl) from nucleic acid mol- 
ecule to another in vitro or in vivo without the need to rely 
upon restriction enzyme digestions. 

All publications and patents mentioned in the above 
specification are herein incorporated by reference. Various 
TDodiUca lions and v aria lions of (he described "H^hr»J and 
system of the invention will be apparent to those skilled in . 
Ihc art without departing from the scope and spirit of the 
invention. Although the invention has been described in 
connection with specific preferred embodiments, it should 
be unuerslood that the invention as claimed should not he 
unduly limited to such specific embodiments. Indeed, vari- 
ous modifications of the described modes for carrying out 
the invention which are obvious to those skilled in molecular 
biology or related fields are intended to be within the scope 
of the following fi*im«. 
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AATTCTOTCA OCCOTTAAOT OTTCCTOTOT CACTOAAAAT TOCTTTOAOA OOCTCTAAOO «0 

OCTTCTCAOT OCOTTACATC CCTOOCTTOT TOYCCACAAC COTTAAACCV TAAAAOCTTT 120 

AAAAOCCTTA TATATTCTTT TTTTTCTTAT AAAACTTAAA ACCTTAOAOO CTATT.TAAOT l»0 

TOCTOATTTA TATTAATTTT ATTOTTCAAA CATOAOAOCT TAOTACOTOA AACATOAOAO 2 40 
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CTTAOTACOT TAOCCATOAO AOCTTAOTAC OTTAOCCATO AOCCTTTAOT TCOTTAAACA 

rOACACCTTA CTACCTTAAA CATCACACCT TAOTACOTOA AACATCAOAC CTTAOTACOT 

ACTATCAACA OOTTOAACTO CTOATCAACA OATCCTCTAC OCOCCCOCOO TACCATAACT 

TCCTATACCA TACATTATAC OAACTTATCT COAATTCCCC UUCCTCCACA ACATATCCCC 

ATflfiflfiATCC GCTIOCCGCAA TTfiTTAACAfl AKCtJTCOAr OAGOTCOCTA TCAOCCTOOA 

<:tc;tc:c:<:ttc: t act t«c cac; ccavckstto tTTiiccccTc ccccctccct tccttcaccc 

TOOAAOOTCC CACTCCCACT OTrCTTTCCT AATAAAATGA GOAAATTOCA TCOCATTOTC 

T Ci A (i T A <i II T <! TCATTCTATT CT CI <I Ci Ci it CI T (1 <I CI <! T <J <* <I <I<; A (idACAOCA AO <KM1 O A <2 CI A T T 

COGAAOACAA TAOCAOOCAT OCTOCOOATT CTAOAAOATC COOCTOCTAA CAAACCCCOA 

aa<;<;aa<:c;t<; a<; ttijiic rite, toccaccoct oagcaataac taccataacc ccttccoucc 

TCTAAACOOO TCTTOAOCOO TTTTTTOCTO AAAOOAOOAA CTATATCCOO ATATCCCOOO 

c:T<;c;nc:c a ac; aactccacca tgaoatcccc <;c<x:t<;<; a«« atcatccacic c:<;c;cctc:cc?<; 

GAAAACCATT CCCAAOCCCA ACCTTTCATA OAAOCCOCCO CTOCAATCOA AATCTCOTOA 

tggcacks tt<; (iocdrccci'T (;«tc«(;tcat tt<:«aacc(m: aoac:tccc«c t c a c; a a c; a a <: i o h o 

TCCTCAAOAA OCCOATACAA OCCOATOCOC TOCOAATCOO OAGCOOCOAT ACCOTAAAOC 1140 

AfflAflfl AAflC C.C.TCAGCr.r.A TTCOCCOCCA ACCTCTTCAG CAATATCACO flOTAflCfAAC 1200 

OCTATOTCCT 0ATAOCOGTC coccacaccc aoccooccac aotcoatoaa tccaoaaaao 

OGGCCATTTT CCACCATGaT ATTCfiflCAAO CAflflCATCflC CATGflOTCAC GACGAGATGC 

TCOCCGTCOO OCATOC0COC CTTOAGCCTO OCOAACAOTT COOCTOOCOC OAOCCCCTOA 

TOCTCTTCGT CGAOATCATC C7GATCOACA AOACCOGCTT CCATCCOAGT ACGTOCTGGC 

TCGATCCOAT GTTTCGCTTO GTOGTCOAAT OOOCAOOTAO CCOGATCAAO COTATOCAOC 

CGCCGCATTG CATCAGCCAT GATGGATACT TTCTCOOCAO GAGCAAGGTG AGATGACAGG 

AGATCCTOCC CCGOCACTTC OCCCAATAOC AOCCAOTCCC TTCCCOCTTC AOTOACAACO 

TCGAGCACAG CTOCOCAAOO AACCCCCOTC GTOGCCAGCC ACOATAOCCO COCTGCCTCO 

TCCTGCAOTT CATTCAOOOC ACCOOACAOfl TCOOTCTTOA CAAAAAOAAC COOGCOCCCC 

TOCGCTGACA OCCOOAACAC OOCOOCATCA OAOCAOCCOA TTOTCTOTTO TOCCCAOTCA 

TAOCCOAATA OCCTCTCCAC CCAAGCOOCC OOAOAACCTO CGTOCAATCC ATCTTOTTCA 

ATCATOCOAA ACOATCCTCA tcctgtctct toatcaoatc ttgatcccct OCGCCATCAG 

ATCCTTOOCO OCAAOAAAOC catccaottt actttocaoo octtcccaac cttaccaoao 

<i«<:<ic:c:c<;Ac; <:k;<icaa ttc cchjttccictt gctgtccata aaaccciccca otctagctat 2040 

COCCATOTAA OCCCACTOCA AOCTACCTOC TTTCTCTTTO COCTTOCOTT TTCCCTTOTC 2100 

«:acjata<u:<:« a<i r acictoac: att c: a r c:cu<i ««ti: a«jc a<:it cit t tc to ccici actogctttc luo 

TACOTOTTCC OCTTCCTTTA CCAOCCCTTO COCCCTflAOT UCTTGCOtiCA UCUTGAAUCT 2330 

( 2 ) INFORMATION PDA SBQ ID Mkfc 

( i )5CRQII10<mrHAaAA.IIUIlftlUJk 
( A )rJ04RT1C Kfcwfalm 
( B )TYRiwWe m±d 
( C: ) ZfTUAMOtONKSSt - itli 
( O )TOmrAOM Omm* 

( 1 i >MOLBCUUTYfA«*»McM B .cM 

( a ) utttmirnuMi aim - -oua- 

( ■ 1 ) SEQUENCE DeSCSXTnOMi SBQ ID NOt2l 



3 0 0 
J • 0 

4 2 0 

4 S 0 

5 4 0 

6 0 0 

6 6 0 

7 2 0 
7 8 0 
MO 
9 0 0 
9 « 0 

10 2 0 



1 2 « 0 
I J 2 0 

i j a o 

14 4 0 
13 0 0 
1 3 < 0 

16 2 0 
1 6 • 0 

17 4 0 
1«00 
1140 
19 2 0 
1 9 » 0 
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OGATCCCCOO GAATTC 



( 3 ) INFORMATION FOtt SPQ ID NO-.1; 

( i ) SEQUENCE CHARACTEBISTTCS: 
( A ) LUNCTIk 34 (mm pus 
( R ) TYFIfc acid 
( C ) STRANPfiDNRRSr rfrWrfo 
( D > TOPOLOGY lin 

( 1 i ) MOLECULE TYPE: utfn udck actt 

( a ) nwraimnw: /dcpc . -hna- 

( v 1 )NI{QWt(NCK IMiSCRIKTKIN: Siit) II) Hlfcl: 
OOATCOCATA TOCCCATOOC TCOAOOATCC OAATTC 



( 3 ) INFORMATION FOR SEO ID NO>4: 

< I ) SEQUENCE CHARACTERISTICS: 
( A ) I JtNCmb 43 faw pa** 
( B ) TYPE: aaduic «cU 
( C ) STRANDEDNBSSc •m&t 
( D )TOfOLOO% Ubm 

( I I ) MOLECULE TYTE? «tfha ndde *cW 

( A ) DESCKimOH : /doc - UNA" 

( * 1 ) STOtmNCTt nnSCHTFTTON: ftPQ XT> NO*; 
CATCOCTATA ACTTCOTATA OCATACATTA TACOAAOTTA TO 



( 3 ) INFORMATION FOR SP0 ID WOA 

( A ) f HNCTTK: 43 tmmm pM» 
( * ) TVFR, mMc «U 
( C ) STRANDEDNESSt ■fa*o 
( I) )-ft)«MiKnft Iteur 

( I i ) MOl Rnn K TVFR: «*j«r ...Me m±i 

( A ) DESCRIPTION: ASmc - THAT 

( * i ) SEQUENCE DESOUFnON: SBQ ID NOiS: 

OATCCATAAC TTCOTATAAT OTATOCTATA COAAOTTATA OC 



< 3 ) INFORMATION FOR SEO ID NO* 

( i )NKoni(NO<c:itAHA<:natL«rnc3tt 

( A ) LENGTH: 44 bme pua 

( B ) TYTE: a«*4e wU 

( C ) STBANOBDNESSi cfc«k» 

( » ) nmMixnr. 

( i t ) MOLECULE TYTBj «<tnc OMdeis sckf 

( A ) MCtClCirnUN: Mw - -ONA" 

( - « ) NMUUIMCK UI«t3UrnCINi MI1Q id 

cjuccugacov cataacttcu tatacjcatac attatacoaa ottato 



< 2 ) INFORMATION POO SEQ ID ND*?i 

( I ) KKOJIRW31 CHAHACTTKHomnV 
( A ) I JOKmG 44 hM« print 
( B )TYrfb ncide «cM 
( <: ) SrHAMDUDMieCb 
( D )TOTOfOOY5 l«r 

( I I ) MOLECULE TYPE: «*w kk** «cW 

( A ) IKOMtttmON: A*m .-OMA- 



< m I > SEQUENCE IHBLittWKJ Wi SBQ ID NOlTi 
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OATCCATAAC ttcotataat otatoctata coaaottato acotcc 



( 3 ) (KPORMATinM POB STQ ID NO* 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LUNCTlb 4* aaa* pal» 

( b )m^M^« drf 
( c ) othandrdntusi 

( O ) TOfOL0<7£ lam. 
( I 1 ) MOLECULE TYTE; albcr oactefc add 

( a ) nnsmirnnK: mm - -dna- 

( w I )NI{Uill<N(3<lH«C.*RIKnCIN:.SItQ II> NCMb 

TCOACACOTC ataacttcot ataocataca ttatacoaao ttatoc 

( 3 ) INFORMATION FOR SEO ID KOA 

( i > SEQUENCE CKAAACTESUSTICS: 
< A ) I JO* .TIL 44 bi patm 
( 8 ) TYPE: aadoac acid 
( C ) STVANDBDMESS: atarfo 
( D >T0ffOLOO% 1km 



< I i ) MOLECULE TYFB: a« 

( A ) OeSCBimON: Mew - DKA- 

( > i ) .srcotnwm onrmmnN: sno id no-.* 

COCCOCATAA CTTCOTATAA TOTATOCTAT ACOAAOTTAT OACCTC 

( 3 ) INTORMATfnN TOO JOTQ fT> KO:fO: 

< I )aUAlUUNCUCIIAHACn»0IU2l: 

( A ) IKNT7TK: 1740 tM p-'-a 

( It ITYI^mMihU 

( C ) SntANDEBNBSSt *»M« 

( o ) mnmxiYs ttacar 

( 1 i > MOI WinRTYFB, «J» aariaie acid 

( A ) DeSCBirnQN: Abac - "DKA" 



( i « )f 

< A ) NAME/XEY: CD6 
( B > LOCATION; LU737 

( * i ) SEQUENCE BESCOTTION: SBO ID NChltt 

ATO TCC CCT ATA CTA OOT TAT TOO AAA ATT A AO OOC CTT OTO CA4 



I « t Mar Pi 

t 



Ola T j r Trp I. j a I I • I. f • (! I f I. a a V • I (1 I * r r , 



ACT COA CTT CTT TTO OA A TAT CTT OA A OA A AAA TAT OA A OAO CAT T TO 

Tar Ar« I. «a I... I. a a Ola Tjr I. a a (I I a (1 I • I. y a Tyi Ola 41 I a III* I. • « 

SO 3S jo 

TAT OAO COC OAT O A A OOT OAT AAA TOO COA A A C AAA AAO TTT OA A TTO 

T,r <)la Ar 9 Aap. (Hi Oly Aap I. ym Trp A t $, A* a I. » • I.,. Ph. Ola I. 

* « □ 4 3 

OOT TTO OAO TTT CCC A A T CTT CCT TAT TAT ATT OAT OOT OAT OTT AAA 

O I , 1 . . Ol. r a • » » • Aaa I. a a »ra Ty* T T r Id Aap fll, Aap V.I 1. . . 

s 0 S3 6 0 

TTA A C A CAO TCT ATO OCC ATC ATA COT TAT ATA OCT OAC AAO CAC AAC 

I. a a Tfc, Ol. !t a r Mai Ala lit f|« A r a T y r II. Ala Aap I. * » HI. A . a 

0* 10 73 i0 

ATCI TTO OOT OOT TCT CCA AAA OAO COT OCA OAO ATT T C A ATO CTT OAA 

Met la. Ol , OI» C,a Fr. f. fm fl|, A r ft Ol. M, S«, Mai |. a. Ol, 
* 3 a o «s 

ooa oc:<: orr ttci oat att aoa tac: oot ott r<:o aoa att oca tat aot 

Ol f At, V.I Lea A.p II. Ai| T,« Oly V.| Sat Ac* 11. Al. Tft Sa« 

1 0 0 1 05 no 



39 
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40 



AAA CAC 
L j m A • 9 



T TT 

r b c 

l l s 



G A A 
C I * 



ACT 
T h i 



CTC 
L e • 



AAA 



OTT 
V • | 

I 3 0 



GAT TTT CTT 
Aap Pbe Lei 



AG C A A O 
Set L y • 



CTA CCT GAA 
L 0 % Pro Gig 



A TO CTG 
Met L c • 

1 3 U 



AAA 
L J • 



A TO 
Me l 



TT C 
P b c 



O A A 

G I 9 



GAT 
A . p 

1 > 5 



COT 
A r « 



TTA TOT CAT 
Lev Cy. Hi* 



AAA ACA 

L r • Tbt 



TAT TTA A AT 
T j i Lea A • o 



GOT GAT 
Gl y A • p 



CAT 

H i • 



OTA 
V • 1 



ACC 
T h c 



CAT 
H i • 
1 S O 



CCT 

r i o 



CAC 
A • p 



TTC AT(! TTG 
P fc « Met Lee 



TAT OAC 
T y i A a p 



OCT CTT GAT 
A I i Lev Aep 



O T T CTT 
V • I V.| 



TTA 
L « » 



T A C 
Ty r 



A TCr 
U • i 
I e S 



OAC 
A . p 



CCA 
P r e 



ATO 
M« t 



TOC CTO OAT 
C 7 • Lev Aep 

1 7 0 



OCO TTC 
Ale rii 



CCA AAA TTA 
Pre Lye Leo 

I 1 5 



GTT 
V • I 



TGT 
Cy . 



TTT 
Pbe 



AAA 

Lj • 

I It O 



AAA 
Ly . 



COT 
At t 



ATT 
I t « 



O A A 
G 1 » 



OCT ATC CCA 
Ale lie Pro 



CAA ATT 
Gla lie 



OAT A A O TAC 
Aep Lye T y r 



T TO 
Lee 



AAA 
Ly • 



TCC 
S * r 
I V 5 



AOC 
S • r 



A AO 
Ly . 



TAT 
Ty r 



ATA 
I I e 



OCA 
A 1 e 
lOO 



TOO CCT TTO 
Tip Pre Lee 



CAO OOC 
O I • O 1 y 



TOO CAA OCC 
Tip Ola Ale 



ACG 
T b i 



TTT 
Pbe 

2 i a 



GGA 
O 1 y 

3 2 S 



GOT 
Ol y 



GOT 
Ol y 



OOC 
01 y 



OAC 
Aep 



TCT COT COT OCA 



TCT 

2 3 O 



CAT 
H 1 • 
2 • 3 



CCT 
P c e 



CCA AAA TCO 
P t • Lye Set 



OTT OOA 



TCO CAT ATO 

2 3 5 



OAT CTO 

Alp Lee 

3 3 O 

CCC ATO 



OTT CCC COT 
Vet Pi» Ac s 



OCC AAT TTA 
2 4 0 



CTO 
Lev 



ACC 

T b T 



OTA 
V ft I 



CAC 
11 1 e 



CAA 
<i I m 
3 4 5 



AAT 
A » e 



TTO 
Lee 



CCT 
Pre 



OCA TTA CCO 
Ale Lev P r • 

15 0 



OTC OAT 
V • I Aep 



OCA ACO AOT 
Ale V b t S*r 

2 S S 



GAT GAG OTT 



COC 
2 6 0 



A AO AAC CTG ATO 



OAC ATO TTC 
2 6 S 



AGO OAT 



COC CAO OCO 
3 7 0 



TTT TCT 
P h e Her 



GAG 
(I I « 

2 7 S 



CAT 
II I • 



ACC 
Tb r 



TOO 
T r p 



AAA 



ATO 
Me t 

2*0 



CTT CTO TCC 
I. • ■ Lee !t e r 



OTT TOC 
V e I <: y e 
2*5 



COO TCO TOO 
A r £ Scr Trp 



GCG GCA 
Ale Ale 



TGC 
T r p 



TOC 
C y . 



A AG 
l.y . 



TTG 
I. e s 



AAT 
A ft ■ 

29 S 



AAC 
A m m 



CCO AAA TOG 
Arg 7 * Tf p 



TTT CCC 
Pk« Pre 

3 0 0 



GCA GAA CCT 
Ale Gift fro 



GAA OAT 
<; 1 m Asp 



OTA AAA 
V ft I |. j * 



COO TCC 

A r ft S • t 



OTT 
V ft I 



ACT 

T h r 



OOO 

€t I y 



COC 
A r a 



ATC 
I I e 



CTO 
I. * e 
3 4 O 



OAT 
A . p 



CAO 
<! t e 
3 3 3 



TAT 
T y r 
3 1 0 

CAA 
(1 I e 



COA 
A r q 



CTT 
I. e e 



CAT 
II I ■ 



CCA 



CTA 
Lev 



TTO 
I. • m 



AOT 
3 e r 



TAT CTT CAO 
Tyr |. e e Ci 1 e 



OOC CAO CTA 

<l I 7 (I I t I. e • 
J 3 0 

OAC AOC AAT 

Aap Set A*e 



OCO COC 
Ale Arf 



AAC ATO 
A a e Met 



OCT OTT 
A I • V a I 



GOT CTO OCA 



<i i y i. e * 



A I e 

3 3 0 



CTT CAT COT 

Lit II 1 ft Artf 
3 3 9 

TCA CTO OTT 

Her l.es V.I 
3 3 0 



ATO COO 
Met Are 



COO 
A r a 
3 5 3 



A TC 
I f e 



COA 
A . ., 



AAA 

I. y • 



OA A 
CI I e 



AAC 
A ■ e 
3*0 



OTT OAT OCC 
V • | Aep Ale 



UOT OA A 
.« I y CI I e 



COT OCA AAA 
A r ft A I i I- * • 



CAC OCT 
O I ■ Ala 



CTA 
I. m m 



G CO 
A I e 



TTC 
Pee 



CAA 
O I • 



COC 
A r ft 
3 7 3 



ACT 
T h r 



CAT TTC OAC 
Aep P h « Aap 



CAO OTT 
O I e Ve I 



COT TCA CTC 
A r ft Ser I. • a 



ATC CAA 
Mel O I ■ 



««<; ATT 
O I y f I . 



ACJCJ (ITT 
A r ft V.I 



AAT 
A a a 



OCT 
A I • 



AAA 
Ly • 



AOC 
Mar 



TAT 



OAT 
A • w 

4 2 0 



OAT 
A . p 



AAC 
A . • 

4 0 3 



ATC 
I I • 



COC 
* r II 
3*0 

ACC 
T ft. r 



TCA 
S m t 



TOC 
Cy . 



CTO 
I. e a 



COT 
Ar « 



CAO 
O I e 



TTA 
I. « e 



A C T 
Tb c 



OAT ATA COT 

Aap lie Are; 

3*3 

COT ATA (ICC 

A r A lie Ala 
4 • 0 

CAC (HIT ()()(! 

Aep O I y O I 7 
4 3 5 



AAT CTO 
A e a I. e a 



OA A ATT 
Ola lie 



AO A ATO 
A r ft Ml I 



OCA TTT CTO 

Ale Pbe U« 
4 0 0 

(ICC A (1 O AT C 

A I ■ A r ft r I « 
4 I 3 

TTA ATC CAT 

L • • (le Hi* 
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ATT CCC A C A ACO AAA ACO CTO OTT AOC ACC OCA OCT OTA OAO AAO OCA 

lie Cly Atx Tbr Lj i Thr La* V a I Ser Tbr Ala Oly V.l CI* Ly i A I a 

4)5 4 4 U «4) 

CTT AOC CTO COO OTA ACT AAA CTO CTC OAO COA TOO ATT TCC OTC TCT 

Lea Set L « . Oly V a I Tbr Lji L « a V a 1 Ol i Ar| Tip lie S«r V a I S « r 

*S0 4 S 5 4 6 O 

OOT OTA OCT CAT OAT CCO AAT AAC TAC CTO TTT TCC COO OTC A C5 A AAA 

Ol, V.I Al. Aap A. f fra A. a A*. Tyr Lc* the Cya A«« V.l Ar« L y • 

♦65 470 475 4IO 

AAT OCT OTT CCC CCC CCA TCT OCC ACC ACC CAC C T A TCA ACT COf CCC 

Aa* Oly V,l Al. At. M* Sar At. Thr S.r Oil L . , Sar Tbr Ar f Al. 

4R5 4 * 0 4«S 

CTO C A A COO ATT TTT OA A OCA ACT CAT COA TTO ATT TAC OOC OCT AAO 

L«» Gli CI 7 II. Pb« Olo Al. Tbt HI. Ai| L • • I I * T 7 r Oly At. Ly. 

510 



5 0 0 5 o 5 



OAT GAC TCT OOT CAO A OA TAC CTO OCC TOO TCT OOA CAC A OT OCC COT 

Aap A . p S • r Oly Ola Arg T71 L * * Al. Trp S.r Oly HI. S.r Al. Ate 

SIS 5 1 O 515 

OTC OOA CCC OCC COA OAT ATO OCC COC OCT OOA OTT TCA ATA CCO OAO 

V.l Oly Al. Al. Ai| A.p Urn t Aim Arg Al. Oly V.l Sar II. Pi« Ol* 

5 5 0 535 540 

ATC ATC CAA CCT OOT OOC TOO ACC AAT OTA AAT ATT OTC ATO AAC TAT 

S * 5 55(1 5 55 SCO 

ATC COT AAC CTO OAT AOT OA A AC A OOO OCA ATO OTO COC CTO CTO OA A 

... A.. A.. .... A., S ., «... T». . 



OAT OCC OAT TAO 
A ■ p Oly Aap 



( 2 ) INFORMATION FOR SBQ E> HCfeUi 



HO 5 7 5 



( i ) SEQUENCE 

< A ) I J04CTK 57* mmtm» «*a> 
( B )TYfB:amaaoac»fl 

( i i > MOLECULE TYPE: patlnai 

( * 1 > KKumwcst iMac:mrnoNt .1110 10 noii n 

M « 1 S c / T 1 «, I I « Lea Oly Tyr Trp Ly* lie L 7 a Oly L • « V.l Ola P», 
I » 10 15 



CI. Ar 0 A.p <1 I « Oly A.p I. y . Trp Af S Asa I. y . I. y a < 
3 5 4 0 4 5 



My !,•« Ota r b a p ■ a A. a La* P r « Tyr Tyr I I i A.p Oly A. a V.l Lti 

SO 5 S ft 0 



L«* Tbr CI* Sat Mat Ala II. I I a At* Tyr Il« Ala A.p Ly* HI. A • • 

• S 7 O 75 V 0 

U.I I. a * Cly Cly Cya 9 e m 1. pa Ol . Arg Al. Cla I I . Sar M.I I. • . Ola 

• * 50 95 

Oly Ala V.l I. • • A.p II. A c a, T y t Oly V.l X m t A r a II* Al. Tyr A • r 

100 105 110 

11s tao ' • . • • " r ,'JJ *** ' * ci* 

M. I L.a Ly. Mai Vki Ol* At f Ar« La. Cya HI. Ly. Tbr Tyr La. A.* 

«30 11s 140 
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V. 1 


V * | 


Lev 




M« I 

I 6 S 


A • p 


P T • 


M« t 


Cy . 


Ls a 

1 7 0 


A. p 


A 1 a 


P b s 


P r a 


Ly a 

I 7 5 


Lao 


Va I 


Cy . 


r b c 


Ly • 

l 8 0 


Ly . 


Ar « 


I 1 « 


G 1 a 


A 1 • 

i a $ 


I 1 « 


F r e 


G t a 


I 1 « 


Asp 

19 0 


Ly . 


Ty i 


r c . 


I. , - 


.1 w r 
1 © S 


S o r 


1., - 


Ty , 


I 1 c 


A 1 . 

2 0 0 


T r p 


P r * 


1. c • 


O 1 a 


O 1 , 
2 0 S 


T r p 


O I . 


A 1 a 


Th r 


r h « 

2 10 


<* ' r 


<; i y 


<; i 7 


Asp 


II 1 s 

2 15 


P f o 


Pro 


l.y s 


X • T 


A s p 

2 2 0 


1 « a 


V a 1 


P r o 


Ar { 


01 y 

2 2 5 


S • r 


At * 


Ar< 


A 1 a 


S • r 

2 3 n 


V. 1 


0 I y 


S • r 


U 1 s 


las 1 

2 3 5 


P r o 


Us I 


A 1 a 


Ass 


Lao 
2 4 0 


L c • 


Tb i 


V. 1 


H i • 


a i m 

2 4 5 


Asa 


Los 


t t a 


A 1 • 


U. 

2 5 0 


Pro 


V« I 


Asp 


A 1 s 


T b i 
2 5 5 


Set 


A. P 


O 1 o 


V » 1 


A(| 

2 « 0 


Ly . 


Asa 


Los 


M« t 


Asp 

2 6 5 


Mo 1 


P b s 


A r ft 


Asp 


A r ft 

2 7 0 


G I a 


A 1 a 


r fc • 


S a t 


(1 1 a 

2 7 5 


it ; • 


Th r 


T r p 


■• r • 


M « 1 
2 8 0 


1. s a 


l.s a 


H a r 


V s 1 


<: y s 
2 a s 


A r g 


S s t 


T r p 


A t a 


A 1 a 

2 9 0 


T » P 


C 7 i 


Ly. • 


L * • 


Asa 

2 9 S 


Asa 


A r g 


Ly s 


T r p 


Pb « 

3 O 0 


P * a 


A t s 


Q 1 a 


P f a 


G t • 

JOS 


Asp 


V. | 


A r * 


A • p 


Tyf 
3 10 


Lea 


L e s 


Tj/ 


Loo 


G 1 • 

3 15 


Ala 


A * ft 


O | 7 


L a a 


Ala 

) 2 0 


V. 1 


1. y * 


Th r 


1 1 o 


c; i * 

3 2 S 


C* 1 a 


II i s 


1. ■ a 


il 1 7 


CI 1 a 
1)0 


1. • « 


Asa 


Mai 


1. a s 


II 1 a 

3)5 


A t m 


Ar» 


S « r 


CI 1 r 


1. a ■ 

3 4 O 


P • a 


A r ft 


Pre 


S « r 


Asp 

3 4 5 


Sa r 


Asa 


A 1 a 


V ■ 1 


A a r 

3 S 0 


1. a a 


V s 1 


M « t 


A r ft 


A r ft 

3 5 5 


1 1 * 


A ■ ft 


Ly | 


<J 1 a 


Asa 

3 6 0 


V.« 1 


Ass 


A 1 «' 


(Jim 


G 1 a 
3 4 5 


A * ft 




L y • 


Gli 


A 1 • 

3 7 0 


Lev 


A 1 • 


p h 0 


G 1 • 


Ai| 

3 7 5 


Th c 


Asp 


P b 4 


Asp 


O t ft 

3*0 


V a 1 


A v « 


S • f 


L a a 


Me 1 
J a 1 


G f . 


A ■ a 


S v r 


A a p 


A * A 
3 9 0 


C. 7 s 


O I a 


Asp 


t 1 a 


A r ft 
3 9 5 


Asa 


1. a a 


A 1 s 


P b s 


1. a a 
4 0 0 


G t y 


( t « 


A 1 a 


Tyf 


Asa 

4 0 5 


T b i 


Lea 


L « s 


A f ft 


1 1 • 
4 10 


A t a 


G t a 


I 1 a 


Ala 


A r a 
" • a 

4 15 


C 1 a 


Arj 


V • 1 
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5,851,808 

-ouolinucx) 



( 1 ) INKJRMA11UN t-X>U ZULJ ID 

( i ) SEQUENCE CHAAACTOUFnCS: 
( A ) LENGTH- MbtKpib 

( u ) nrn*: cvcfeic «=w 

( d ) TOfOf-Orr* t»r 

( i i ) Moijaruij'.-rvrK: Mi«rnKi«k ku 

( A ) DESCUrnOM: /dae - DMA" 
( m i > SEQUENCE DESCRIPTION: SEQ ID HO: 12 
ATAACTTCOT ATAGCATACA TTATACOAAO TTAT 



( 7 ) INKMUAIION KHI SI*) H> W»:I3: 

< i ) SEQUENCE CHARACTEROTTCS: 
( A > LENGTH: *4 bmm put 
( H ) TVnC wide *cA4 
( C ) STRAKDESNBSSe doable 
( D )TOCOLOGTC Umm 

( 1 i )M(MJ«»flJtTVri{:<rfWntKickac<4 

( A ) DESCRirnON: /doc - TWA" 

(si) sugui£NU£ utscRurno*: smj id nojji 



( 7 ) INFORMATION FOR SFQ TO NCfcU: 

( I ) SEQUENCE <3HAAACTERISTtCS: 
( A ) UiNUTlt: 34 ten pits 
( R >TYWli Mrfdoidd 
( C ) STttANDflDNRSSt ik»U« 
( D > TOPOLOGY thcar 

( i i ) MOLECULE TYTE; *mcU±t .cid 

( A ) nraCRtFTOM! Mm - UNA' 

< k t )SI{UtItWC3{»ICIt:RtKnUN:!UKllUNUc14: 

ATAACTTCOT ATAOCATACA TTATATOAAO TTAT 



< 7 ) INFORMATION FOR SBO ID NOiU: 

< i > SEQUENCE CHARACTERISTICS: 
( A ) I J<MCrni! 34 hut fmtm 

< B ) TVTB: mm*>U mM 

< C ) STttANDEDtNBSSc dcwMc 
( D )TOffOCOOVIfaw 

( i < ) MOijanriiiTvrMt «u« mWk «u 

( A ) DBSCUrnONi /dM - TWA" 
( * t ) SI«>UICNO< tMCIiniimUM: ai«J IO Ntfcl.t 
ATTACCTCOT ATAOCATACA TTATATOAAO TTAT 



( 3 ) INmnMATTON TOB SBXQ ID KTfctt: 

< I ) iUOlUUNUi UlAAACTlflUniUb 
( A ) I XMCTTKi 14 hM 
( n )TYWfc ■■iila *M 
( C ) STBANDBDKBS& Samktm 

( o )'ronMixr« mm 

( 1 I ) MOimnHTYTWi iM 

( a > DBScaxrnotfc mm - that 

( a 1 ) SEQUENCE DSSCRtFTONi SBO ID NOtl* 



ATAACTTCOT A V A CI T A T A C A TTATACOAAO TTAT 
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( 7 ) INIVttMAllUN IOR SUJ U> 

( I ) SEQUENCE CKAAACTEEUSnCS: 
< A ) LENGTH: 34 hue p» 

( a ) mm mddc m=m 

( D ) TDnrH/XTY: IUuk 

( 1 I ) MOMtt.UIJ'.TYFIti **cr aadck »eVl 

( A ) DESCUirnON: /dew - "DNA" 

(si) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

ACAACTTCCT ATAATGTATG CTATA CCAAG TT AT 



( 2 ) fNMMMA'llON KM SHQ II > NO-.ltt: 

( 1 ) SEOUENCE CHARACTERISTICS: 
( A ) LENGTH 34 b«M pfck» 

( ii ) nrnt mWc «u 

( C ) STUANDBDNESS: 
( 0 ) TOCOLOGY! Iww 

( 1 t ) MOUtUfUtTYrifc ritv.^M acid 

( a ) oescurnoti: At** - tjka" 

< x 1 > SKQUCNCK OESXJUJrnOM: &UU ID NO-.lJfc 

«aa<;ttccta ttctctaoaa a<;t at ao<; a a cttc 



We claim: 

1. ApUNl vector construct comprising, in 5' to 3 1 operable 
order 

a) a conditional origin of replication; 

b) a sequence -specific recombinase target site having a 5* 
and a 3' end; and 

c) a unique restriction enzyme site, said restriction 
enzyme site located adjacent to said 3* end of said 
sequence -specific recombinase target site. 

2. The vector construct of claim 1 further comprising a 
prukaryutic tcrminaliun sequence located 3* to said unique 
restriction envymc site. 

3. 'I "be vector construct of claim 2, wherein said prokary- 
olic termination sequence is the '11 termination Kcqucnce. 

4. The vector construct of claim 1 further comprising a 
eukaryotic polyudenylaiion sequence located 3* and adjacent 
to said unique restriction enzyme site. 

5. The vector construct of claim 4, whereto said poiyade- 
ny la tioo sequence is selected from the group consisting of 
the bovine growth hormone polyadenyLatioa sequences the 
simian virus 40 pol video viatica sequence tod the Herpes 
simplex virus thymidine kinase polysdcoylation sequence. 

6. The vector construct of claim 1 further comprising a 
selectable marker gene. 

7. 'J nc vector construct of claim 6, wherein said selectable 
marker is selected from the group consisting of kanamycin 
resistance gene, Lbe ampicinin resistance gene, the tetracy- 
cline resistance gene, the cHonmphcrricol resistance gene, 
the streptomycin resistance gene, the strA gene and the sacB 
gene. 

• 8. The vector construct of daim 1, wherein said sequeuce- 
speciiic recombinase target site is selected from the group 
consisting of loxP, loxP2, loxP3, loxP23, k>xP511, loxB, 
lozC2 t loxL, loxR, loxA86, IoxA117, fit, di£ tod art. 

9. The vector coostruct of claim 1 further comprising a 
gene of interest inserted into said unique restriction enzyme 
site. 
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10. The vector construct of claim 1. wherein said vector 
construct is pUNl-JLO, comprising the nucleotide sequence 
set forth in SCO H> NO:l. 

U. A pUNl vector construct comprising, in 5* to 3* 
35 operable order 

a) a conditional origin of replication; 

b) a sequence-specific recombinase target site; and 

c) a poly tinker, said polylinker located adjacent to said 3* 
^ end of said sequence-specific recombinase target site. 

12 . A pIIOST expression vector construct comprising in 5* 
to 3' operable order: 

a) an origin of replication; 

b) a promoter element; and 

45 c) a sequence-specific recombinase target site. 

13. Idc expression vector construct of daim 12 further 
comprising a selectable marker gene. 

14. A fused cxpresskm vector construct produced by the 
recombination of a pUNI vector and a pIIOST expression 

50 vector, said fused expression vector comprising in 5* to 3' 
operable order 

a) a promoter element; 

b) a first sequence-specific recombinase target site having 
a 5* and a 3' end; 

55 c) a gene of interest joined to said 3' end of said sequence- 
specific recombinase target site such that a functional 
tran sl ations! reading frame is created; 

d) a conditional origin of replication; 
Ml e) a first selectable marker gene; 

0 * second soqucocv-spccific recombinase target site; and 
B) &n origin of replication. 

15. The fused expression vector construct of claim 14 
further comprising a second selecublo marker gene. 

65 16. A method for the in vitro recombination of a pUNl 
vector and a pHOST expression vector to form a fused 
expression vector, said method comprising: 
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a) providing: 

i) a pUNT vector construct comprising^ in 5* to 3* 
operable order, a conditional origin of replication, a 
first sequence-specific recombina.se target site and a 
first selectable marker gene; 5 

ii) a pHOST expression vector construct comprising, in 
5* to 3' operable order, an origin of replication, a 
promoter element and a second sequence-specific 
rccombinasc target site; and 

iii) a site -specific recombinase; io 

b) contacting said pUNI vector and pIIOST expression 
vector constructs in vitro with said site -specific rccom- 
binasc under conditions such that said first and second 
constructs are recombiaed to form a single fused 
expression vector. 15 

17. Moe method of claim 16, wherein said pUNI vector 
construct further comprises a gene of interest located adja- 
cent to said 3* end of xakl first sei|tience*spccific rccombi- 
nasc target site and before the selectable marker gene, 
wherein the recombination of said pUNT vector and pHOST 2U 
expression vector constructs places said gene of interest 
under the transcriptional control of said promoter clement 
within said fused expression vector. 

18. The method of claim 17, wherein said pi IOST expres- 
sion vector construct further comprises a nucleotide 25 
sequence encoding an aJfinily domain and the recombination 

of said pUNI vector and pHOST cxression vector constructs 
results in placing said gene of interest in frame with said 
sequence encoding said affinity domain, thereby encoding a 
fusion protein having the affinity domain located at cither the *u 
amino- or carrKixy-terminus of said fusion protein. 

19. 'Hie method of claim 16, wherein said pHOST vector 
construct further comprises a second selectable marker gene, 
said second selectable marker being different from said first 
selectable marker. 3* 

20. A method for the recombination of pUNI vector and 
pi IOST expression vector constructs to form a single fused 
expression vector in a prokaryotic bod, comprising: 
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a) providing: 

i) a pUNI vector construct comprising a conditional 
origin of replication, a first sequence-specific recom- 
binase target site having a 5 1 and a 3* end. a unique 
restriction enzyme site, said restriction enzyme Kile 
located adjacent to said 3' end of said sequence - 
specific recombinase target site, and a first .selectable 
marker gene; 

ii) a pHOST expression vector construct comprising in 
5* to 3* operable order, an origin of replication, « 
promoter element having a 5 1 and a .V end and a 
second sequence-specific recombinase target site 
having a 5* and a 3* end; 

in) a prokarvotic host ccD expressing a sUc-spccific 
recombinase; 

b) introducing said pUNI vector and pi IOST expression 
vector constructs into said prokaryotic host cell, 
wherein said host cell expresses a site -specific 
recombinase, under conditions such that said pUNI 
vector and pHOST expression vector constructs are 
recombtned to form a single fused expression vector. 

21 . ITjc method of daim 20 further comprising growing 
said host cell containing said single fused expression vector 
under conditions which select for the presence of said single 
fused expression vector. 

22. The method of claim 20, wherein said pUNI vector 
construct further comprises a prokaryotic termination 
sequence. 

23. The method of claim 20, wherein said pUNI vector 
construct further comprises a gene of frifprfgH inserted into - 
said unkrae restriction enzyme site. 

24. The method of claim 20, wherein said pUNI vector 
construct further comprises a eukiryoric polyadenylation 
sequence. 



